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s iiidusfrlally relewarit yieids, they may Is^ prone to various types of degradatiofi aridfer imtsfoifes. 
The present Ifivsntion pra^?!des methods Ipr produping^ s by expressn^^g ther^i as 

pmtsasf^ eompil^lns a hfteroie^ous is^o-^sequ^ce. The fhvsntlph furSiief provides the rsstiiing 
proteases coi«pHsi«g such heterobgous |3rosequeiices> 

The present inv^nlior? rdales to isofated pofyp^ptiiSeigE having jprol^ase activity r^latsd 

10 to % Nae$fxihpss& $ii. protease, and isolated niiclele add se<;?uences encsding syoh protease®, 
Tt^# isiventloh furtliermore relates to nucleic aclcf (^oiistruii^s/vectoj^i, and hosi e#ils compryr^g 
these fiudelc acid ssqu^ce^ as well as methods fer produdKg aruil using the proteases, in 
p8ft cuifr #ithih animal feed. 

Polypaiptides having protease acti\?ity, or proteases, are sometirnes also designated 
peptidases, protelnaseSj fseptid© hydrofes®«, or proteoiytio en^yttses. Protessss may be of the 
exo4ype that hydroSyses peptides starting at eltlw end thersot or of the endo-type thst aci 
iMemally in polypeptide chsins (encfopepticiases). Endopeptkiases show activity on and C- 
as tenrninslly blocked peptide substrates that are relevant for the specifidly of the protease in 
question. 

The term -'profease' Is defined herein as an enzyme that hydroiyses peptide bonds, it 
indudss any enzym« belonging xo the EC 3,4 eriayrne group (including each of th© thirteen 
■ subclasses thereof)- The SC number refers to ghsyme Nomeodature 1992 torn NC-IUBMB.,. 

as Aoadertiic Press, Ssn Diego, California, including suppiemenfe 1-5 published in Eur, J. Biochem, 
1994, 223, 1-5; Eur- J. Biochem. 1995, 232, 1-6; Eur. J. Biochem. ISM, 237, 1-5; Eur. J, 
Blochem, 1997, 260, 1-5; and Eur, J, Bldchem. 198S, 264, 61<HBSG; respfctively. The 
nomenclature is regularly supplemented arid ypdaled; see e.g. the Wortd Wide (VVWVV) at 
httD://%ww,chem .qni w.ap..i|j/!Ubm 

m US patent pobiication Ho. 2002/01 82672A1 disdosss, that If one dr tvsfo c?f the last two 

amino adds at tha £>-tefminiJs of a polypeptide Is/are charged poiar: D or £ (negatiwly charged) 
or K R, Of H (posiitiveiy djiat-ged), tfje tall would foe considered pdar, charged, and th?s makes 
the poiypeptide r^lstanl against proteolytic degradation lay a satjoiass of proteases that 
recsognsie dorv-poiar C-^tenTsinal taiis of sesreted pretelns. 
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Another siscloisyf© reported, that prdlne ?^ldues at the C:«terminus Of nascent 
polypeptide chains Induee desfas^^tSon of the polypeptide (2Q0^. PmM residuigs at the G 
ta?mihus of nascervt chains mduce SsrA tagging during translation tsrmlnatton, j.B!al,0hem, 
27^:33829-33823). 

$ 

mmmAm ofthe inventioh 

\l m a weWnowii pfotJleirn m the art of ek^re?;slng poiypeptldgis having prpteoi^e aqtlvl^s 
that m^y of sucl^^ pols^ptides ar© Inhereiitl^ anetable; th^y rf?ay be subjec^t© aut^pmteeHysfs, 
pf fey rhay be tarsieted fpr is*^mda8^^ by ptiner prc^asef ^re^dy dafing their preducfissri, 
10 fssiiKIng in subroptfmai yteids; Many other factors coritrlbute to their mstalbillty, not si! of 
#5|ch are yndeii'i^ppd alprs^seni li fe of gre^t Intsri^tto pFOVl# prpteoiyt^ pp!y|3ej;H^^^ 
Increased sisbilfty, may thas be prsKJucedio higher ySelds. 

$^eted prdi;#a$es of the; and/or S1E sSaissl8sat?pp, often have a pro^r^glon wHfeh 
is deavsd fmm the protease id presdaoe tie mature part of the profuse, This present 
IS Inyentors have foufid, that prpdactldn pf S2A ahd/or S1E pro1^a$es §s fusipn polypeptides 
G&mprisirsg a tsaterobgous pro^mglorE results m much Iniprovsd yields when compared with 
prMudtion of the unaltered wlfd%p0 f>?oteasss. 

Accordingly, in 8 first sspsd tl-ie invention relates ts a secreted mature polypeptide 
which after maturation has protease activity, which polypsptkie when expressed and tsefore 
20 meluration comprises a heterologous pro-region and wploh polypeptide; 

(a) comprises an amino acid sequenoe which is at least 70% identical, %, or preiigrably 
75%, 80%, 85%, 86%, 87%, 8B%, 89%, 90%. 91%, 92%, 93%, 94%, 95%, 96%, 
97%, 88%, or 99% idsntical to the amino acid sequence of the mature part of the 
polypeptkJs shown in SEQ ID NO: 28; SEO ID NO: 33; SEQ ID HQ: 3?; SEO ID MO: 

25 41 : SEQ ID NO: 43; or SEQ ID NO; 45; 

(b) comprises an amino add sequsncs which Is at isast 70% identical, %, or prcferahly 
75%, 80%, 85%, 86%, 87%, 88%. 89%, 90%, 91%,, 92%, 33%, 94%, 95%, 96%, 
87%, 98%, or 99% identlcai to the ariiihis acild s^uenc© Of the mature part of the 
polypeptide encoded by itie polynycleofe ln Si^^ 1; SgQ ID NO; 2;, SeO iD 

30 NO: 2S; SEQ ID NQ; 31; SEQ ID NO: 32; SEQ ID NO: 30; SEQ ID NO: 40; or SEQ 

ID NO: 44; 

(c) opmprisss a mature part whieh Is a variant of the matune paft of the polypeptide 
has^hf the amino sequenos of ^EO ID NO: 20; SEQ ID NO: 33; SEQ 30 HO: 3T; 
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SEO SD ftl0r 41; SEa ID ND: 43;: or SE<S 10 NO: 4S, s^ment conipnsing a 
sul>stM<«:i, del©tlf?n, «s#nsl6fti ^n£|/Q^ iM^ftiqn of ones csir morss #niiilo iods; 
(d) Is ef}09dS(S fey « nuotefe acki ses^u^nse wftfeh h^*bri£ikss unsjer very iow, l^rn^ 
mecliurn^ow, medium, me^lurrhhilghi, Ng^, OFwr^j^igbstrMgerifCjirc^ji^^ 
$ 0} a poipueteoMe enoicling the matyra part of a jsretssM, said 

polynudeotide oblairsable from genomfp BUA from TVoe^/opsIs 
iimmmli&l siitap. dasmnifMi^ DSM 43235 by use of primers SEO IB 
Npk 26 and 27; torn Nocmti^opBkSta DBM 15047 fey use (Sf poitnefSi 
$B0 ID 34 and 3S; fern /Voc^/tif/ppsfs DBM 1S64S by i:ise of 
ID pfimefS SEO JO NO's: 38 m4 38; or torn Noea^fop&(s imma PSW 

1 by MSfe of pnmsis $EQ i® M0%t4? and 30; 
(0) the psslynwefedtids of SEQ 10 N0: 1^ of SEQ iP 140; 2; of SEQ \D NO; 25; 
of SEO fP ISiOr31; of SEQ m NO: 32; of SEO !0 HQ: 36c df SEQ iD NO: 
40;^ of S€Q 10 NO; 44; 

16 0n) a subsequence of (i) (II) &f &l less! SOO nupSedtid^, pmferabiy 400, 

300, 200, or 100 Jiucieotifies, or 
(IV) a connptementsfy st!-and of (f), (fl), or 011); 
(e> ss art altelc variant of (a), (b), (c), or (d); or 
(I) is a fragm«rtt of (a), (b), (c), (d), or (f). 
20 Iri a second aspect, the invention r-siates to an !Sv"slated polynucteotide encoding a 

poiypepllde as defined in the first aspect; or an isolates polynucteatide encoding a fcsion 
poiypepilde v^iiioi> after maturation has protease activltyf which before maturation comprises a 
heterologous pro-region, and which polypsptlde; 

(a) comprises an smino acid sequssnoe whicb is at least 70% ielenticai, %, or preferably 
2S 7B%, 80%. 86%, 86%, 87%, 88%, 88%, 80%, 01%, 92%, 93%. 94%, 98%, 96%, 

97%. 98%, or SS% identical to «ie amino add sequence of the mature part of the 
pclypeptide sitown in SEQ ID I^Q: 28; SEQ JD NO: 33; SEO ID NO; 37; SEQ ID HO: 
41; SEQ ID NO: 43; tsr SEO ID NO: 4S; 

(b) comprises an amino acid sequence wElch # ieasit 7Q% id^ntlbaf, %, or preferably 
39 7S%. 80%, 85%, 86%, 87%, m%, 89%, SG%. 91%, 92%, 93%. 94%, 95%, 96%, 

97%, 98%, or 99% Identicarto the amino acM saiqu^^ matym past of the 

polypeptide encodgi^ by ths pblyntfeleotlde in SEQ ID HQ: 1; SEO IP NO: 2; SEO IP 
NO: 2S; SEQ ID NO: 31; SEQ ID NO; 32; SEQ ID NO: M; SEQ IP NO: 40; or SEQ 
IDI«0:44^ 
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(s) com prises a m;stur@ part which is a yanarst of Ih© rnatum prt of ti§ poSypsptMs 
having file smms acsd seB^uenc© SEQ ID NO: 28; SEQ ID MO; S3; SEO ID NQ: 37; 
BBQ ID NO; 41; SEQ ID HO: 43; or SEQ IP HO: 45, the ssgp^m osmprising a 
sMbstiisifpn, deietipni escii^nsiori, and/or Insiarfion of on® of mors srtiisio assicls; 

(d) }s sjicod^d by a nyslefe acid ssq^ne® wNoh hy^jndizais under vary tow, tew> 
m^lum-loWi medlMm, rnediunvhigii, Jifgh, of ves^ f^lgh strfrsg^noy cor^diilorjs with; 

|l) a jpslynuGieotide fenedding the mature past of a protease, asid 
polyjiuclsotide obtainal)te from g^Romto DNA 1mm NociWdi&^sfs 
cfaSv^nwfM syfcap, dmmnviiles DBM 43235 fey um cjf primers SEQ ID 
NO*s; 20 and 27; torn N&csfdiopms Mba P$IM 1§647 fey use of piirties^ 
S£Q ID N0%; M aiid 35j from Nomrdiopm pmms DSM 1 fey use of 
primers SEO ID MO's; 38 and 3i; or from Noeardic^m prmim DSy 
1Sg49 by use <3t prim^ SEQ ID NO'^; 42 ar?d 39; 

<l!) me poiynuoleotide of SEQ ID NO; 1; of SEQ iD NO; 2; of SEQ ID HO: 25; 
of SEQ !D MO: 31: of SEO 10 NO; 33; Of SEO ID NO; 36; of SEQ ID NO; 
40; w of SEQ ID NO: 44; 

(ill) a subsequejisf of (I) m {U) of at least 5CH) nucleotides, preferably 400, 
300, 200: or 100nucleoi:(dss, or 

i\V} a complemsntary strsnd of (i), (ii), or{!l!); 

(e) is an alelic vanant of (a), (b), (c), or (d); or 

(f) is a fragrrsent of (a), (b), (c), (d). or (f), 

SliSl. in a tl>ir<i aspect, the invent'son relates to a recomi>lnsnt expression vector or 
polynudeoiicis construct con-^prising a poiynuclisatkie as deilned in ti^e previous aspect 

Yet a fourthi aspect reiatss to a recombir^ant host celi comprising a poiynucieetide as 
definsfd in the secottd aspect, or ai? expres^loi'J vector or poiynucisotlde coi>struGi; as defioed in 
file previous aspecf. 

In a fifti^ aspect, the ln\^t'e^.'^l$Q:i^l^i^ fe>.a-lr^iB8|Nr^e.p^nt or plant ii^ft, comprissng 
a poiynucieotide as defined iri the second aspect, or an exprsssion smdim or polyrsueleollde 
constot as defeed in the ihM aspeol, 

Tbs sixih as^pecrt of the Invention r^lsfes to e tans^erilc, nori-human ahirnai, or products. 
Of <sf^«its IberBof, comprising a poiynucileotide as d^rsed iii tNe second aspect, dr an 
expression vector or pblyrsyPleotide cortstfuot as iSeftr^ in tHe third aspect 

The seventh a^pest of the invention rdates to a method for producing a poiypepiidb as 
dsfiiied in flie first a^p®«* i ti^e method comprising: (a) cultivating a recombinant tsost cell as 



4©flnM in thfefeyrth asped, w a trajBfsnfc pisnt or animal as defined In th^s Sit) or sMH 
sspectSi to produssf a ®upaf pt^t compHsfng th© poly^^ op^ior^aNy p) rt€»^yer?f^ the 

polypfep^cl©. 

Oth^r aspects of then iEisyentlCJti rgitete to; an animasj tei adcfilivs comprising at l^a^t on^ 
S poiypepde as deineii in the first aspect; afsd 

(s) M isaslpn^fat-soiybis vitamin, and/or 

ieasi en© wat^r-sbiubi® vitamin/and/or 
(p) at li&astfiim trace mbemi; 

animal composlticsn having a pru^ pretein content pf SO to 800 g% and 
10 c«mpf Ssing at least one poi^fpeptida as ilefi tied in jla flr^ asipect, m at iem^ pn® IseiEi additive of 
W& previous a$ps<?t; 

a sompdsltfon csompHsing at isast one pdypep^lde as deSned in tlh® llf^ sspact, together 
with at fe^st ene pttssr m^me ^efesfed torn amPfigst phyt;ase (EC 3.1.3,8 O!" 3.1,3.26); 
xyianase (EC 3;2»18); ^afeotanase (EC 3.2.1,89); aipt^a-gaiactosidas© (EC 3,2,1.22): protease 
If (EC 3-4,-.,-|, phospiioSipas© At (EC 3>1 .132); phosphof jpas® A2 (EC 3.1.1 A): lysophospliolipase 
(EC 3.1.1.5): phosphoiipas© C <3.1.4.3); pho$pho?fpasa .0 pC 3.14.4); ar^d/or bsta-^lucanase 
(EC 3,2.14 or EC 3,2.1,6); 

a methocf for ussng at S-sast one poiypsptide as deiined in the fimt sspsct, fbr ii'f^proving 
the nutntional value of an animal feed, for increasing digestible and/or soluble protein in animal 
20 diets, for inorsasing the degree of i^ydroiysls of pmtesns in snimal diets, and/or for the treatmsnt 
of vegetable proteins. iiie method compfising including tt^e ppiypeptide(s) in anlaiai feed, -andtor 
In a oorrtpositlon for use In animal feed; 

a metf-sod for using at least one peiypeptide m deftr^ed m the first aspect, comprising 
including the pPiypspicie($) in a detergeM foji'nulation. 

25 

DETAILED DESCRIPTION OF THE IMVENTIOM 

Proteases are dassifiad on tiie basis of fceir cataiyic mepftanism irSo the foiSpwing 
groups; Ssrine pmtsesas (S), Cysteine pi-c$eases (C), Aspartio prcsteasess (A), yetalloproteases 
(M), and yriicnown. o,r as yet yncfassKi^, pra^taam (U), see H^nd&dek of PMedl^^^^^ 
m A:.J.Barrett, N.D.Ravs/iings, J.F.Wosssner (eds), Acaderhic Presss (1098), in partiPblar the generai 
intrpdustkin part. 

Serine proteases am yblqufteus, bmn§ found in vinise^j feacterla &M euRaryotes; they 
inelude assiepe^idase, endopepSdaeOy oSgopeptidas© and pmega«f:^pt|dase acivity. Oyer 20 
faitnlfe (denotad SI - S2?) of sarMe proteases have been Identified, these being grouped into 6 
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GMm^mm^ SA, SS, se> SE, SF, aid S<i, the basis of structur^i s!misr|y and fenctsonai 
evidence (BafTetl ^ §1, HandbCKik of p?ot0d^ytfe ens^me^), Stfu am taown for at 
four of the cians (SA, SS^ SG «nd $E), thes®! appear to be tolaiiy urirf^atsd, suggesting at 
least four evoMlpnary oflgfns of serine pef^tl^aseis. Aipha-lps endop6|?^da$s# beissritg t<> th^ 
& ehyrtnotrv^isin (SA) dm, within wNdi they have bmn assigned to subffam^f A of the S2 fmi^ 

Afiotiisr classlficate systern of prelesiytfo ensiifmes is based m ^^mm mfomsMQn, 
an<i Is iiieresfore used mom often in the art of moiecular bloiogy; it is desolfee^ Rm^iihp, M.D, 
et St.. 2002, MERORS; The protease dat^Nse. Nyctete Adds Res. ^;343-346. Tire MEROPS 
10 databas© is teeiy ayaitafcie elecfeonlpally at hto://s/jfww.merops.aa Ac«of^|ng to tha 
NIEROPS system, the proteoiytiei efSSy^e^ d^ssifsed as S2A in The Hans^Ssooi^ cf l^fOteoiytfe 
Enz^ss*, are in ^EROPS da^sifi^ as ■S1H' proteases {Rawlings NO, Bafrett A-i, (1S83) 
S'^oiutiphary families of pi^jiMases, Slodhsm, J, 2;80;205^»21 8), 

ih partieular erribodfmersts, the proteases isf ^e invsnion and for use according to the 
is fey^ntbn #^ sebeted from the ^3^oup consis^fjg df; 

(a) proteases foeionglng to th@ EC 3.4.-.- e?5zyme group; 

(b) SeHne proteases belonging to the S group of lfte above Handbook; 
(ci.l Sedne protssases of peptidase fair^lly S2A: 

{c2) Serlrse prot^tasss of peptidase famlfy S1E as described In Blochem.J, 200:205-218 
20 (1893) and in ?<,1ER0PS a proteas-e database, release 6.2G, March 24. 2003, 

(ww/.r^ierops.ac.uk). The database is described in Rawflngs, N,D., O'E'lers, E. A. Bt Bam% AJ. 

{ZQQZ} MEROPS: the protease database. Nuctelc Acids Res. 30, 343-346, 

For detefmsnsng wljether 3 given protesee is a Serins protease, and a family S2A 

protease, refereoc® is made to the above f-iandbc-ok and the principles Indicated there-in. Such 
25- determination can he carried out for ail types of proteases, be It naturally occurnng or wild-type 

proteases; or genetically engineered or synthetic proteases. 

Protease activity can be measured usln§ sny assay, In which e fobstrata Is employed, 

thatlnclud®® peptide bonds relevant fof the spediclty of ^e pmtease irt qys^^ Assay~pH end 

assay-iemperatufe are likewise to i?e adepted to the i:s'oti^s^ irs question, Exairiples of aesay- 
30 pH-^^aipes are pH S, 3, 4, 5, 6, 7, §, % 10= 1 1 , or 12. Examj^es of assay^ien^peratures are 30, 

3S, 37, 40, 4S, m> 56. 60, BS, 7S, 80, 90, or 0S*C, 

Examples of protease substfates arfe easein, such as AKudne^^rossiinkad Casein 

{AZCboasein). Two protease assays are desoiltssd in Example Z herein, eitier of whi#5 can be 
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is S pr^rrsd assay. 

Them am m Sfmitailsns on the origin of tiis pratsase of th& mm€on and/or fer use 
aocordsng to liia liivention, Thys, thsi term protease inclu<jes tidt mlf mpdrM m mM-%ip& 
s prc^eases obt^lnesl fforn HifCFOOfganfems of aiy genus, but ateo mf matsn^ varlanfe, 
fragments etc, thereof e^lbitfeg prote^^^ aepyity, as smW as synthetis prsstesses, sush as 
shuffled pM00m, and cehs^nstss prdtei^es, Sueh pftsi^cisliy eh9lne©f®d |>mteasee cssi be 
jsrepamd as is gersarajiy known In tiie aftj e§ b)? Sfte-d|mctad Mutagsnss|s, by PGR (using a 
P0R fragment eGhtajfiiiig tfie clesirec! mt^tMion ae prse the pnfrifm in the FCB msGti^ or by 

15 Random Mutagenesis. The pmpsrafion of consensus prolelns is ^^prSbied in eg EiP 

The term ^dbtaioed freit?" a^ used Hew In ooi^nedfeh with a given sownse siMI mean that fee 
polypeptide enecsded by the nucleic acid seciuense is |!sroduoed by fes soures or by a ^1 In 
which the hudeic aold sequertbe from ihe sourfee is preseht in a j?refemgd iembodirnent, the 
polypeptide is sessretec! ^tfaceliuiarly, 

16 !n a spedfio embodiment, the ijrotease ife a loypilergenic variant designed to Invoke a 
reduced fmmunbiogicai resipohse when exposed to animals, inciudsng mm. Tiie term 
immunotogtcal response is to be yndetstood as any action by the immune §ys^®^ an animel 
exposed to the protease. One type of immunoioglcai r©$pO!ise is m allergic response 'leading to 
sncreassci ieveis of IgE in the exposed ar^lmaK lovv- allergenic vanants may be prepared using 

20 techniques known tn the art. For exampse the protease may he conjugated with polymer moietiss 
shieldirfg poslions or epitopes of the protease invosved in an Immunologicei response.: 
Conjugation with pslyniers may invQive in v/fro chemical soupiing of polyf?-5er to the proteaBe, 
e.g. as described m VVxO 86/17929, WO 98/30682, WO 9a/35026, and/or WO 
Conjugation may in adclltlcfn or aiternatfvsiy thereto involve in vivo coupling sf polymers to the 

2S prolsase. Sudi conjugation may be achieved by genetic englns-snng of the nucleotide sequef^se 
srscodlrsg the protease, inserting consensus ssquences encoding additional glyossyiation sites In 
the protease and e,xpressing tiie prateas>s in a iiost capabie of glycosylating the protease, see 
e.g. Vv'O 00/26354, Another way of proviaihg lov»/~ailergenis; varisftts Is ger^tic ongitK^ering of the 
nucleotide sequence enoGdirjg the pratease sss as to j^usethe protease to self-Qifsomerke, 

so efeciing that protease hiohomers may shield the epitopes of OBier prtf^ease fhonomsns and 
thereby lowering the antigenicity of the; digooiers. Such products and their preparation Is 
descnbed e.g. in WO 96/1 $1T7, Epitopes Involved in an immunological response may im 
identified by vanous metliods sudi as the phage dtsplay method described In WO S0/2S2S0 ar^d 
Wd 01/83BSE or the random approach desa-ibed in EP 661907. Onc& m epitope lists been 
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MmW^yM wmm M&id sequepjee may fcs® alterad to prodtjcs afeed ln\mun0foplea! properties 
0f the protease by known gena manipylaijon techrif^^^ dsfacled mut^gen^is 

WO 00/26230, WO ans^^or WO 00/22103) and/or oonfugate of a polymer may i>e 

dcso6 if) sufefes^f proximity to the epltopeforfisepol^^r to sN«ld tte^pSipps, 
S Tfi^ first aspect of tie invention felatos to ^ secreted mature pdpeptids whfeh afe 

matumtlon lias proitea$e actfv%, yMck polypeptide when expressed ^nd isefore maiumfe 
cdrripnsss a hetofologow^ pr^^ 

(a) compnses ars amino acid seqyefios whleh Is a* least 70% Menlisai, %, or prefembfly 

75%, m^, m%, 8e%i 87%, p%, S9%, mm, 01%, 92%, 93%, 9s%, 9s%, 

IS §7%, , or 9S% Iderifcai ;to the efrslnp acid $siqusr)ce of tile matur® psri of the 

polypeptide ^mn m SEQ ID HQ: 2^; IP HO: 33rSEa ID NQ: 37; SiQ ID I^O; 
4i ■ SEQ ID NO: 43; or SEQ !0 NO: 45; 

(b) comprises #11 ^mlno ^pld ^esiu^nce «^ieh is at least 70% identical, %, of prefefabiy 
7S%, 80%, 85%, BB%, 8?%, ©8%, 89^, 90%^ 91^, », §4%, 0S%, 96%, 

ie :97%, 98%, or 99% identical to the amino ®cl4 sequence of the msfure part of the 

poiiipeptide encoded by the polynuc'sctlde in SEQ ID NO: 1; SEQ ID NO: 2; SEQ ID 
NO: 25; SEQ IP NO: 31; SEQ: IE) NQ; 32; SEQ ID NO; 36; SEQ m nO: 40; oj SSQ 
ID NO; 44; 

iQ) c-ompfises a mature part whicii is a variant of the mature p:^rt of ti~!S poiypeptide 
m havsng the amino acid sequsnce of SEQ ID MO: 28; SEQ ID NO: 33: SEQ SO NO; 37; 

SEQ ID NO. 41; SEQ \D HO: 43; or SEQ © NO; 45. the segment comprising a 
substitution, deieticsn, e;<t8n$iori, and/or insertion of one or more :^minQ acscts; 
(d) is encoded by a nucleic acid sequence wi~sich hybridises under verv low, low, 
medium-low, medium, msdium-high. iiigi-s, csr ver>' high stringency ccnditions with; 
2S: (I) a poiyrsucieotide erscalng ti>e mature part of a protease, said 

polynudeotide obtainabie from genomic DNA from NocaixfiQpm 
dmmnylM mbsp, dassQmM&f DBU 43235 by Use of primers SEQ fD 
NO*s; 28 md 27; t&m NocBfdiopsf& Alba DBU 1SS47 by use of primsrs 
SEQ flD N0'$; 34 and 35; from Nm&f&iop&b pm^m DSM 1 5648 by use of 
38 l^^msrs SEQ ID NO*s; 3S and 39; &r from Nocai^icpsls praMm DSM 

15649 by use of primers SEQ ID HO^: 42 and 39; 
|lf) tit© polynucledtids of SEQ ID NO: 1; of SEQ ID NO; 2; of SEQ ID HO: 25; 
of SEQ ID NO: 31; of SEQ ID NO; 32; of SEQ I D NO: 36; of SEQ ID HQ; 
#; or of SEa MO; 44; 
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{III) a subsgqu^nce of (I) or p) of at tea$t SCiS nudeo^des, preferably 4G0, 

300, 2C0, pf 100 nucteotld^s. <>r 
pv) « comptementary str^d of (i;^, (H), or {|SI); 
{&} m m alfefte yarisnt of (a)> ®, (c}, or |c}); is^ 
a (I) isaffagrnef^<rf(s)* (tj)^ (d), or (f). 

For th® pufposes of the present tnventevlliei degrs^ identity betw^er? two amifto as^d 
sequ«ns«s, as well ss the clegr^^ of iclentty |3etweetii two nucieoSde sequensss, !® ietemiifKfed 
by thd pr<5gram ^ilgtf ' wNeh is a Needteiman'-Vyunsdb aljgnmesit a gbfaal aiinnias'st), the 
program used lor Mlgnmrnt of pcilypeptiile, as as nudec^is^e s#qiiu«m;«s. The dsfaolt 
1Q eoortng matfti BIOSUMSS is yssd for polypeptide alignmgnis, and tse <ieifau!t identity mafeix js 
used for nucSeotids altgorrients. The jseiially for first fssidue of e gap i® -12 fer pij3yp8|5t|d©§ 
and -16 for nudeotides. The penalties fs^ further residue of a gap are -2 for polypeptide, and 
■4 for f^udeotide. 

''M0f b part of tse FAStA package versfdn v20uS W. R- Pearson and D, J. 

1§ Ufnrsen (1S£i8), 'liriproved Todis for Biological Sequence Aiiaiysls", PNA8 as;2444~2448, md 
W, R, Pesrson (1890) "Rapid and Sensitive Sequence Compailson wfili FASTP and PAST A," 
Methods in Erszymoiogy :183:63^98). FASTA protein ailgnniente use tiie Smith-Waterfnsn 
algorithm vs^th no imitation on gap size (see "Smsth-VVatemian aigoritlinf' , T. F, Smith and B. 
Waterman (1.981) J. Moi. BioL 147:19S~1S7). 

20 The degree of identity batvsfeen two amino acid sequences may aiso be detsrniinsKi by 

the Ciustal metiiod (Higgins, 1989, CASiOS 6; 161-1 S3) using the LASERGENEf^' 
iVsEGyMiOM^' so%/are (DNASTAR, Inc. Madison, VV1) wifn an identity taijie and the fo,towing 
muftipSe alignment parameters; Gap penaity of 10, and gap length penaity of 16. Pasrvvise 
alignrnefTl paramsters are .Ktupfe~l gap pe.nalty~3, windows-S. and dia9ana!$"6. The degree 

'28 of sdsrstily between two nucleotide esquences may be determined using the same algorithm and 
soitware paci(age as cieecribed ai^ove xf^-itii the foiiowirig settings; Gap penalty of 10, and gap 
length penalty of 10, PalrvsJse alsgnmsnt parametore are Ktuple-3> gap psnalty^S and 
windo'A;s~20. 

A fragn«Jt of one of the encoding poiynucteoMe saqyenoes of tte Invention i^ ^ 
m psiynudeotlde which gnoodes a poiypepMs having one or rhore amino adds deleted from the 
ammo and/or carfooxyi ^fTnir?ye compared to the fuiNength Brmm acid se^u^oe. In one 
embodirt)eht a fi^gmsnt enoodee at least 7S amino add residnes* or at least 100 amino acid 
residues, or at Ifeest 12S amino acid residuas, or at l^ast ISO amtlno aoid residues, or at leasi 



160 aitsin© acid rssldues, or at teasi 165 amino add msldues* or st 170 amlo^j aci?i 
residues, or 3t least 175 amino adl^ resi#m 

An ailslic variant d<sndt©s mf^ of Wo onser© aitemative fomis of a g©n« occupying t^s 
ssme chfeniosornarioeus. AJIelic yanafion anses naturally ^ro^igii niiiaHon, asid may mstilt m 

S ^>cifymofpNsm "ij^hl^^ po^latlom. Gane mutations can be sliant (n«5 ci^ange in ft?® encoded 
l^lypspii{te) or may enssoda poifypeptidas havkig altered &mm acid sequ^enees*. An alisfc 
mmr^ of ^ jsofypeptida !$ a poiypptide encoded by ap ailelic variant of a gena 

Tli« presant inyehtion also relates to isolatiSd p6?ypapM©s having prptaase activity arid 
v#?idft am ©f?codad by nuclaio acicl js^qusiicas whi0^ und«r vsjy f6% or low, or iew- 

10 madfum, madium, madiumihigb, higii, or very high sUffngency cprsditionis wllh a nuclele ac^d 
profee wbloh hyfoddi^s under tha saim cdhditlons with (1) a isplynucleoiJda encoding tiio mature 
part i5f a prtjtaase s^talnablb frorn genomic DNA imm Nomr^lo^sis xiasmn^lHef sabsp^ 
dBi^mMiD^M 432iS l?y use of pfirrtatB SEO IS NO's; 26 and ZTyftem M6cmdl<ps!8 Aiha 
DSM 16647 by ijse of pnmars SEQ ID f^'s; 34 and ZBii?<m Nmaitihp$m pmslm PSM 1S648 

15 by use Of pflmer$ SEa ID'NO's: M md M; or from Afo^«^^c? p^Bs/na DSM 1 6649 by use of 
pnmors SEQ !D NO's: 42 and 39; (i!) the polynudeoMe of SEQ ID HO; 1; of SEO ID HO: 2; of 
SEQ ID NO: 25; of SEQ ID NO; 31; of SEQ iD NO: 32; of SEO ID NO: 36; of SEO ID NO; 40; or 
of SEQ §D NO; 44; (11!) a subsequshce of (i) or (II) of st ieast 500 nucleotides, preferably 400. 
300, 200, or 100 nucleotides, or (IV) a cornptements5ry strand of (I), (Si), or (III) (J. SafT5broo]<, 

20 E.F- Fritsch, and T. Mansstis, 1989. Mohcalar darmg. A LaboiBfory ManaaL 2nd edition, Coid 
Spring Harbor, Nav^ York). In orso particuiar embodiment the nucteic acid probe ss selected from 
amongst the nucleic scid sequences of (a), (b), or (c) above, A polynucleoiide corrssponding to 
the mature peptkie encoding part of SEQ ID NO; 1 SEQ ID NO: 25, SEQ iD NO: 31, SEQ ID 
MO: 32, SEQ ID NO; 36, SEQ ID NO; 40, or of SEO ID NO; 44, b a preferred probe. 

;2S The nusieic acid sequences of SEQ ID NO; 1 , SEO ID NO; 25, SEQ ID HO: 31, SEQ ID 

MO: 32, SEQ ID MO: 38, SEQ ID NO: 40, or SEO ID NO: 44, or a eubss€|uence thereof, m well 
as the amino acid sequences of SEQ ID NO: 28; SEO ID NO; 30; SEQ 10 MQ: 33; SEQ 10 NO; 
37, SEQ ID NO; 41, SEQ ID NO: 43, or SEQ ID Iv^O; 4S, or a imginnsnt thereof, sM mm a 
genomic polynuiMaotida encoding a prptaass oMalrtabl© tern g^nbmls DHA itm NogaMhpm 

30 dm$6i^W^0 subsp, dmB&nvim 4323S by use of pn'nws SEQ ID NQ's: 26 and 27; from 
Nm9l0opsis Aa mM 16647 by use of primers SEQ ID IIO'a; 34 and 35; frsm P4QCsrdbpsls 
pmsm& DBU 15M8 foy use of priniem SEQ ID NO's: S8 and 38; or frofrs iteses^^ 
PSM 1S84S l>y use Of primaf« SEQ1P NO's; 42 aid 39, or a subsequence thereof, may be used 
to dMsn a nyoiaic a^y probe tp ldei^^^ dom ONA encoding polypeptides baving protesage 
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act vity from stras?i§ of dfeant Sonera or species a<xo?^ling to JDeihqds wsil knowoin the art. In 
partloplar, such pmim can ise for hybridteatlors 

Sfjefejes of Inter^^t, felMving st^dsrxJ Sout^jsm islsttiiig pfocsdysw, in ostler to identify aid 
ii^late iie correspondrng gene fher^n. Sush probss can fee sort^idsmfeisf shorts ths« the entirs 

S sequence, slleuld fee M l^ast prefersfoiy at isaet 25, and more pr^sraisly at least 3S 
oucteotides In feng^b. Longer probes can also be ysed, Both DMA en<^ R^i^s^ profees can u^f 
The probes are typleaily la&eled for det^tlng the <?piT«spond!ng prie (im mmtipM, mih ^H, 
^®S, blotin, Of evii^in). Sueh pmbss ar® eneornpassed by the preseot iovention. 

Thiis, a gendrnte DNA Of qDHK ilbrary prepare frism suo^^ Qt^^^f orgahlsms may be 

19 scmened for DNA tliat hybrSdi?;es With th® probes deecnbed above and v^icSi eoGocies a 
polypeptide having protease aottvity. Genofnio or oilier DNA from such other or^ani$ms maij? be 
sepatat^ by agansse or polyaaylamid© gel elestropi^c^^is, or other s^aratlon tecliniques. 
tMA from ttse libraries or ttse separated DMA may be ^rssferred to and immobilized on 
iii^dc^liuios^ or oth^r stiitafei^s carrier matetial In order to identi% a ciofte or DNA which is 

IS homofogous with SEQ ID NO: 1 or a eMbsedjuenc© thereof, the csrrier matarial Is usod In a 
Sotithiern tslot For piirposes of the prBsent invenlfon, hybndkation f'ndfcates that the nucleic add 
sequence hybndizes to a labofed nucSeic acid probe coitespondirtg to tlis nuc4eio acid sesitienc© 
shown in SEO ID NO: 1, its complementary strand, ore subsequence thereof, under vesv Sow tc 
verjff high slringency conditions, Moiecuies to which 't^© nuci^Sc add probe bybridlKes under 

go these conditions are detected using X-ray f;Sm> 

For long probes of at least 1S0 nudeofsdes sn lengfn, ver/ low to vei7 high stringency 
dorsditlof^s are ciemwd as prei'tybnclfeation and hybridization at 42*C in 5X SSPE, 0.3% SDS, 
200 .ag/nii sheaf^ed and denatured saimon sperm DNA, and either 25% formamide for vary low 
and low strsngenoies, 35% fcrmamicls tor mediiim snd medlurTJ-lsigli stf^ng^3nc!es, or 60% 

25 formamide for high and von/ high stringencies, foiSowing standard Southern blotting procisdures, 
Fssr long probes of at ieast 1 00 nucleotides In iength, the carf ier matehal is finally washed 
three Smes each for 1 5 minutes using 0.2 x SSC, 0^2% SDS, 20% fdrmamids preferably at least 
at (ver^'- low stringency), more preferabiy at least at SO*C (low sthngsnoy), more prefesBbly 
at least at SS^C (medium stringency), more preferabiy at ieast at 80"C (medium-hlgh stnngency}, 

3D even rRore preferably at least at 6S*C (high stfingeftcy), and mod: preiferabSy at: leiast at 70*G 
(very high siringsncy). 

For shod probes about IS nocieotldgs to afeout TO rjudeotfdes iii length, stringency 
asnditbns are defined as prehybridfeetloh, hybridlzatida and VKashirig post-hybridii^ilGn at S°C 
to tS*C bebw the oalcyiated T;,; us!«g th© calculation aecordtng to Soltdn ai^d yoCarthy 



PtoG0&<Mrsg$Mth& Na0aml A0a<^&my of S&lmicB^ USA 48;139Q| O.S M NaGl* J3,09 y Tds-^'NGI 
pH 7,6, t my EDTA, O.S% IX Defthar^iS solution, 1 §odium p5?mfJJscispha% i mM 
soditjm raonobask; phosphate, 0,1 mM MtP, anci 0.2 mg of y^ast RNA per m! Coiiowing standarii 

S Far Short pmim about 1$ rjtJdetoSd^s to a^ut 70 mssiedS'des 'm iengt??, the oaitfer 

materia! Is washed once in 6iX SSC islip QA% SBB for 15 minutes and tw^qe each fm IS 
mlout0s itsini 6X S$C at S*C to 10*0 below the cgslouiated T»i^ 

The present )r?vef^on also relates tss variants of the ipofyp^istlde of the invesiilen, 
c€5fY^pfisl«g a sufestitiitfefi, dsietjon, and/or sns^rtiofi of or more arplmo ^ci% 

1(1 in a particular embodimsht, arnino add changes are of a minor nature, tlsat is 

eorjservi^iivs arnlho acid subst|tu^or» that # siot sigm^ and/or ^(Stly% of 

fee protein; small deletidns, ^Ically of on® to afeatit 30 amtno sm^s; tmnll amSmj- or carboxyK 
termlrsal Mentions, such a§ ah amfetsphai methioriine residue; a sitiaii peptide «f up to 
about 2S>-^5 Emidues^ or a small S)rf:onsfsn that facllat^ purlflcaUon by changing net charge or 

15 another fursetiprs, sueh as a poiy-hisldihS! tmdt, an a or a binding domain. 

Examples of sonservstlvs substiti^lons are ythln the group of basic amino acids 
(argmin©, lysine ar^d hl^Mlne), acidic arnind gtelda {glut^ic acid and a^partic acid), polar amino 
adds (glut^ine and esparagine), hydrophobic amino adds (lietidn©, Issteuclne and valine), 
aromatic amino acids (phenylalanine, tryplophsn and tyrosine), and sma^l amino adds (giycine, 

20 alanine, serine, tiireonina artd methisnine). Amino acid substitutions which do not generally aiter 
the specific activity are known in the art and sre described, for example, by H, Neuratll and R,l, 
Hiil, 1979, h'K Th& P!oie(ns, Academic Press. New York. The most commonly occurnr>g 
■exchanges are Ala/Scr, Val/lte, Asp/Giu, Thr^Ser, Ala/Gly, Ale/Thr, Sef^sn. AlaA/al, SsfVGly, 
Tyr/Phe, Ala/Pro, tys/Arg, Asp/Asn, Lmr/lte, UuA/at, Ala/Giu, and Asp/<3ly as well as these in 

2S reverse. 

In a particuiar embodiment, the polypsptsdes of the invention and for use according to the 
in\?entlcn are add-stabie. For tiie prssent purposes, the ten-n add-stebfe means that the residual 
activity aftsr 2 hours of incubation at pH 3,0 and 37^C, Is at least 50%, as compared to the 
residual actlvly of a corresponding sampia incubated for 2 hours at pH S.Q and S°C. In a 
30 particular embodiment, the fssidual activity is at iaasi 60%, 70%, 80% or at least 90%, 

In particular embodiments, the polypeptide of the invention m i) a bactsrial protease; is) a 
protease of the phylum AciinobaGiem Hi) of the dass AcUmmactem; Iv) of the order 
Aciimmymt&l^B y) of the family Mofraitib^^ v|) the geriy§ |^pGar#5^^^^^^^^ andl/?or a 
protease deHved from vll) NmariMopsis spedes ss.s;h as Noeaitiiopm a/fea. Namrdlopsis 
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mtMrotiGS, NocBfcMopsis prasma, mmpostst, Bxhatemy /j^opMa, fmhioMmnSf kumBmnm:s,^ 

xmpang&mh, «r Nomr^ii^m d^ssptmSm for sxampie Aloifa/tfopsls <km}f}vW&iO^%^ 4323S, 
The aboy© tassonomy Is a«;cordlng to ths chaptw: The m?ic| map to the IMsrsasi % GM, 
s G^ririt^^ J. 8, Hoit in Betsey's yanuaf of %^tfmat!c Baeterioiogitfs 2001, second editsoft, 

volume 1^ Dsvid R Bone, Rschard Castenhofe. 

It tMS fee un?Ser#lsqd that lor the aforerhfefitioneci speeiss, the IhvrfenSorv ^ncompsses 

both th© peifsci and impeifecl states, and other tsxortomic :^uivater^s, f#, anamorphs, 

regan^less of file spscies ntame by which the^ are kn^mn. Jfm^ skiiM In the art w^Si madiiy 
1 Q recognize ttie identity of apijroprlafe equlvitersts. 

Strains of these species sm i^^adily aceessilsls to the public fn a ntimlser of odture 

coliectfohs, mtti as the American Tj^ Culture Qoflection (ATCC),; Deufeche Sammiung von 

Wlkrpo?^en!sitien uhd ^^Ifls'^ft^^^n 

and Agrloaitura! Research Seiviee P^imt Culture Coliectioh, Northern Regional Eeeearch 
1S Center (NRRL). E.g., Nomrdio^^fB dmsonviMl sUbsp. da^konviHel D^U 4323® is pufeiiGly 
available from DSiVIZ (Deutsche Saryimiung von ^ikrodrganlsmen und ZeUkultufen SmbH, 
Braunschweig, Germany). The strain w^s also deposited at other deposi^ry instltMions as 
follows: ATCG 23219, fiVIRU 12S0, NCTC 10489. 

Furthermore, such polypeptides may be Identitled and obtained from sther souross 
£0 Induding microorganisms isossted iron- naiuxB (e.g. , soii, composts, water, etc.) using the abovs- 
mentioned jorobcs. Techniques for isoisting friicfoorganisfns ircm nstura?! habitats are well Known 
in the art. The nucleic acid sequence may then be derived by simiiariy screaning a genomic or 
cDNA library of another microorganism. Once a nuciesc acid sequence encoding a pciypeptide 
ties beers detected with the probe(s), the sequence may be isosated or cloned by utfiizing 
m techniques wilich are foiown to those of ordinary skill in the art (see, e.g., Sambrsok &tal.,1 989, 
supra). 

As defined hemin, an "isolated" polypeptide is a polypeptide which is essentially free of 
other polypeptidss, e.g., at least about 20% pure, pr^erably at ieast about 40% pure, m&m 
preferably afeeyt ®0?<» pure, eyen more preferably abotst 80% pure, most preferably abot4 90% 
30 pure, and even niesf preferabiy aljout 05% pure, as determif^d fay M)S-PAQB. 

Polypeptides encode by naoiefc acid sequer>ces of present Invention aiso include 
fused polypeptides or dlea^i/abie fission polypeptides in which ahpther poiypeptlde is fused at the 
N-tertninus or the C-terrrtnus of the potypeptide or fragment thereof, A fused polypeplid® Is 
produced by lusiRg a nucleic add sequehce (or a poMion themof^ ensoding another ppiypeptide 

IS 



io n ftudele add sequence (Of a poftled ther^soQ si the presentt mvenion, Tsc^inlques for 
produGsng fusion polypsptlcle$ ar^ Ksiown in tj© sit, e.;g. PGR or ih& coding s«qyen?«s 
©jRSO^tog the polypeptides so that they are in frame Sfid that sjcpr^sfen of ilii^ ftissd pol|jmpt!(l© 
is Mndsf eontroi of the sani^ prctnoterCs) aiid terniirjatof. 
§ In present context, hon^poiar amirso aslds are Ai V, I., % M, F or W; and 

Uiichargssd poS^r amino scids gf^ 3, T, H, 0, y\ og C. The te$T«s "non-pqisf and *usii?;i^^rged 
pQ\0f wh0R US0^ to deserilse arnfno apids in ^ poi^iseptide; are generaiiy rseognixed in the art as 
cher^^iissfjg the side-chain of the amino acl<l> Hence, for instance, the tee carhaxylio^cicl of 
the p-terrnlnal ^mino aoid in a jpofyipep^llcfe Is hot considered when detentiinlng whafher thfs 

n amino aad is a n^ni-poSar or uncharged poiar amino add, 

P^p&l^tmd ehlbqdimentiBleatess to^a poiypejstfdeof thefiMa^pe?^ ifnature partis 
a wlidtype poiypsjstide; an artificfal vsnant of a wildtype poSypep^^^^^ sasd variant havsng one or 
rnof^; a!Tslno-'as;id(s) added to thf G^termlmis as compared tft the yidtype sind prefferabiy the one 
or more is(kk^ arnino acldCs) Is (are) ndn-pdiar or uncharge ahd mm more pr#erahly the one 

IS or niore added ^mlno acfd(s) J? orte or nw^ of Q, A, or P; a shuffleci peiypeptide; sir a 
protem-^ngineered pof ^'peptide. 

Yet another pr©far?^d arnfeOidl^ relates to a: polypeptide of the first aspect, whfcli 
comprises at least throe non-pofar or uhdiarged polar amino acids within the last fotjr amino 
adds of the C4®rminus. of the polypeptide; 

2Q. AmthQ4 preferred embodimesTt rsSstes to a polypeptide of the first aspect, wherein tlie 

one or more added amino acids are $electe<:i frorn the group consisting of; QSHVQSAF, QSAP, 
QP, TL. TX QL< TF\ LP. 11 IQ, QP, PI LT, TQ. IT, QQ. and PQ, 

Ti^s Invantors determine^!, that the polypeptides of the present Invention were produced 
irf even greater yisSds when they wefe expressed as mature proteases fused to a heterologous 

,26 pro-region, and/or fused to a heterologous secretion sigr^ai peptide, as shown In the exampies 
below 

Accordingiy, a preferrad emhodiment reletes to the polypeptide apcprdlng to the first 
asp>2ct which wtion expressed and before maturation comprises a heterologods secretion sf^nai- 
peptide which Is deaved: from the pot^peptide wlien tne polypsptkie is seoreted, preferably the 
30 hMerologoys ^e«'^#tion signal peptlda Is dm'<sd from a heteroiogous protease, pf^erably the 
haleroiogous secraiion signal peptide comprises an asY^jho acid sequancs having a sequsr^ce 
jdenljty of at seael 70%, or pr eferahiy 7S%, 80%, 8S%, Sa%, 87%, 88%< 89%. 90%, 81%, 82%, 
iS%i 94%, 80%j 80%, BB%, m 09%, ssltii the amino add sej^uence encoded hy 
^olyniMdeotidee 1 - 81 of SgQ © NO; 2, or SSei fO NO; 44 
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Accstxilrsglys a prefsfreci «mfeQ<iim$iit r^atss to the psiypsptkie according to the first 
aspect which whpn esqsressed and b0fore matui^te fiornprlses a tet^mlogoos pro-mgion Ircssp 
a protease; prefefabfy the prcKegbr? Is cferived from a« or Sie prot^s^, mom prefambly 
ths pro-region Is ©ncoifed l>y a polynue!eQtf# wh'jch ;hy!?rldfe low, riRedte- 

$ tew, medte, m©dfum--high, high, or vary hIgH stnngerifC^ isorsdiMons \s?lth a polynucfeotidfe 
esneodSnglhe pro-rsgion shown 'm posltSpr? to ~1 of SEQ ID HQ: 28, In possMon 1-1i66 sf SECJ 
iP Hp; 30, in posftiert -167 td>-1 of ^EQ ID NO; 33> in posl^ors ~16S |o -1 of SEQ ID N©: 37, fti 
ppsitson "^115 to >>1 of SEO ID NO: 41, In pjsltion to -1 «f SEQ ID NO; 43, m poMm^WB to 
~i mSEQ ID 1^0; 45, m ppsitfon pf^EO !P N0: 4a, 1?) m$m 1-1M ef SEQ ID HO: 47, In 

10 poslte of ieo is NO: 48, m position 1-166 of SSQ Ip NO; 49, In position i-168 of SEO 
ID NO: SS, ill pp^Rfen l-I^^ M ^EQ \D NO: S1, In position 1-166 cf SKQ iO NO: Sg, or in 
^osioh 1-161 el BEO ID 1^0; 03; and most prefefBtsiy it is ati^i!^ 7Q% sderstlcal, or pmMralily 
75%, 80%, 85%. m%i 67%. 89%, 90%, 91%. 92%, # 94%, 9S%, 97%, 98%, or 
99%, Identical to the pro-regi0n shown m position -ISS td -1 of SEO IP M0:^8, in posftiqn 1-166 

IS of SEO iO NO: 30, :fe position -167 to -^1 of SEQ ID NO: 33, In position -166 to -1 of SEQ ID NO: 
37, m position! ~1S5 to -1 of SEO ID NO; 41. in position "165 to »1 of SEQ ID iMO: 43, in position - 

165 to ~1 of S£xa ID NO; 4S, in position 1-1SS of SEQ ID NO; 46, in position 1-166 of SEQ ID 
NO; 47, in positioJ> 1-i6S of SEO ID HO: 48, in position 1.^-166 of SEQ ID MO: 49, in position 1-^ 

166 of SEO ID NO: 50, in position 1-18# of SEQ m NG; 51, in position :|-180 of SEO lO NO: 52. 
20 or in petition 1~1S® of SEQ ID NO: 53. 

Mucjeio Acid Sea ugnces 

Jm prssent invention also reistes fo isOiated nucleio acid sequences fnat encode a 
polypeptide of the present snvention, Paftic-u^sr nudeio add sequences of the invention are tine 

25 poiynucteotldes o? SEQ e NO: 1 , SEQ !0 NO: 2. SEQ ID MO; 25. SEQ ID NO: 31 , SEQ iP NO; 
32, SEQ ID NO; 36. SEQ ID MO: 40, or SEQ ID MO: 44. Another particular nucleic add 
$ec|uence of tii@ invention is the ssqusnce, preferably the mature polypeptide encoding rsglon 
thereof, wiiich is obtainable from genditjfc DMA ftotn /feoarseiapsfe daBsm)/SM BuMp&cms 
d&$$otmiM Q^h'^ 43235, Tiie present invention also eneompa^sas nueiefe sold s®^uer?aa§ 

30 whioh snoode a poiypeptide hsvfr>g th^ amind aci<l ssqu^nc^ of arhlno aad$ shown in positions 
i to 1i^, or positions -185 to 188, of SE(2 lb HO: 43, «/hich n«dsic add secjueneas differ from 
tiie ojrrespofiding parts of SEQ |P NO; 1 fsy virtti^ of tiie dfgsrseraoy of ttie genetic oode. Th© 
present invdfjtion also relates to 3ut>ss£ju@nces of of the afcov® poiyfiudteotide-s wUsb mmd& 
poiypeptid® fragnierite SiM have pr ptedse activity, 

IS 



A subsequsnoe of a polyrmoi^o^dSi is a nrnMo asld sequence from %lifeh «n^: Of mora 
nucfe0tides ft^orn the S* and/or 3' ©nd has beBn d^Nbd, Ffeferabiy, ^ sybse(|U@nce cpnisins at 
l^st 22S nifd^llsles, more preferably ailsast 300 nticleoticles. evsr? more preferably at lea$t 
37^, 4S0, SOO, 531, 600, 700, 800, ^ or 1000 JiydetjMss, The pressrit invenfiQsi afeo retet8§ 
S t0 rsudec^ld^ sequ©he^s whicsh have 0 <l0greie of idteiltfiy iot^e poiynuc^ !D HO: 1, 

SEQ !B NO; g> NO; 2S, iES ID NO: 31, SEO ID NO; 32, SEO ID HO: 36, SEO ID HO: 

m, Of SED iD NO; 44, of at jeast 86%, 86, 57, 88. 8S, SO, 81, 82, 93. 84. SS, or at 

least 99%> 

The teQhttlques u^ed tp fsolsfe or cioiife a rsodeic adid ssiqysfiqs eneoding a poiyi?apfS;ids 

10 ars Mmm ir? the art arid inblud® isolatiors ft^m gartomic DMA, preparation from 0HA, or a 
c^rj^bioaUon thereof. The donlqg of *e nucfele aeid seqaant^s of tl0 preseot myeiltjon fos-n 
such gehonild DHk om he ^ffedtiedj ^,0,, by using the well kfiown polymerase shasn reaction 
(PbR| or antibody scmanltig of expression llbr^Hes to dMsoi oloned DMA fragrnents with sJ^ared 
struetufal features, Sde, e.g., Ifinis a/,, 1000, POR: A BusdB fo MatbOids and Applm$tim, 

ts Aoademic Prsss, ^lew York, Other nucleic; acid empiifetio?i prdfiodures ^uch as ligas^ chain 
reaotidn (ICR), ligatod Activated transcripiion (lAt) and nucieie add s^ciuance'-basoGl 
arriplifieatton (HASBA) may be used. The mdmo add saquerio^ msy t?® olon#d fem a strain of 
NocBidiopsss Of another or related orgarasm end tbua^ forexampSe/rnay be an aStelSc or speaes 
variant of the polype|>tfde encoding region of the nucleic acid sequence, 

20 The term Isoiatsd nucleic add sequence" as used herein refers la a nucleic acid 

sequence which is essentiaiiy free of other nucleic acid sequences, e.g., et icast obout 20% 
pure, preferably at feast about 40% pure, more preferahly least about 60% pure, even moro 
preferably at least about 80% purs, and most preferably at feast about S0% pure as delefrnined 
by agarose electrophcresis. For example, an isolated nucleic acid sequence can be oblalned by 

2S standard clcning procedures used in genetic engineefing to relocate the nucleic acid sequence 
from its natural tocation to a different site where it wiil be rep^-educed. The cloning procedures 
may invoive excision and isolation of a desired nucleic add fragment cor^'sprislng the nucleic acid 
sequence encoding the polypeptide, io®a?tion of li>e fragment into a vector moleculis, and 
incorpcration of the recombmar?t vector Into a host eaU i^tere myltiple eopi^ or dones of tbe 

30 nucleic acid aequehc^ v^'ill fee replicated. The puoi^tc f csd sequerjc^ may be of genomic, cDNA, 
RNA, semisynti-setic, synthetic orign, or any comfoinatf^s ihiereot 

Modificalion of a riudeic add seqoenoe encoding a poiypaptlde of tiie present invention 
may be necessary for the syrstheais of pdiypaptides Substanfialiy sirnsiar to tbe poiypepide, tHe 
term "substantlaliy Simiar" to th& polypsptide r«fars to ndh-naturaliy ocotirring forms of the 



polypeptide. These polypeptyss may dfWer In -pcmt mgimBrM way from tfee po^p^dsi 
isoM©d from its native soufce* e.g , varlmts that difer In specific as^ivl^', tsermostsMi^s pli 
optiffiMfti, ayergeiiicRy, or tte Mb, Th© variant eeqyenc® eishstructsi^ on tha feasts of til6 

nuc^fete acid saqusocs pntsaited as fee |K)iype|jtki6^^m^ part of the poipycleotld^ of ths 

5 Inveniorj, ^.g, a sybs©qsisnce tli^eof, and/or hy intFbduction of nadeotide ssybs^itutibnii whsoh 
do not five rise to anotl^er ami no^Gld ssiqyenca of ths polypeptide encoded fey the Ruoieiic scid 
sequeocs, but which mnmpm4 to the codon usage of tl^e host ofganism inlende<| for 
production of the protean, or Ssy Introdaction of nudeotide suisstltutions which npy give rise to a 
dllfenent amino aeid sequence. Fpr ^ general dg^erlistisn of nudeotld^ substife-ftion, s#^^ o,g. , 

10 Ford et BL, 19S1> PwiBia Ejsp/Bss^oft am^ Piif?SS?afe^ Lovsf-altergsnlc polypeptides can 

e,g> foe prepared as dsse^ibed above, 

It wII! b#: appamnt t«s thosie skiiled in the art that such sybstttt^iorss can be made dytside 
tiie regions critical to the functlori of the moieeuie and stii! r^ult ih an a<$l\?e poiypeptide, Amino 
atsd residues essehtial to the ability of the polypepde encoded by ths Ssoiatedf nucleic acid 

15 sequence of the Invervtiien , and ther^ore pretwahly not sufojeot to sufo^tltuion, may b& ideriiped 
according to prdooiures knoiyn in the art, sijeli as site-direded mutagenesis or aianine-scartning 
mutagenesis (see, @,g., Cursr^ingham and Weli^i 19S9, Sc?enc?e 244: 1D81-108S), In the latter 
teohnique, frsutatfons ere irttrodticed at ev^ry posill^^^ chargsd residue in tiie nioieooiei and the 
fesuitant mutant oioiecules are t^ted ^ protease activity to identify amino add residues that 

20 am cdtical to the activity of the moiecule. Sites of substrate-protease interadion can also he 
deterniined by anaiysis of the thres-dimensional structure as determined by such techniauas as 
nucfear magnetic resonance analysis, crystaiiography or photoaf^inlty laiselling (see, de Vos 
el a/.. 1992, Science 255; 3Q8-3'\Z Smith &LqL, imi, Jmrml oiMolecular BMogy 224: 898- 
804; Wlodaver eJa/.,: tp2, FE8S Letters 3Q$; S8-64):. 
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IM pmmni invention also mlaiss :|b isoiated mudeic add ssqiMn<as eheodlng a 
polypeptide of the present invention, which hylsn<ife© imider very low Mringency c5sodi1ior\s, 
preferably' tow stilngency soodtiSons, mor© preferafoiy medissm stnngenoy cerjditidfjs, mom 
pfeferabl^f medium-high sthngenoy sondltfons, even more |»eifefabi^^ Ngh stringency cenditfons, 
S ars^ rnost pfsfer^fely very Mgh Vaith a tiijscl^'c aisid prob^ wHifeh hy&rfdizes 

arbiter fhs same conditions ivitb tJm nucleb add secjuence of th^ tnverife or Its cpnipfementary 
strand;: Of sfbiic vanarst^ and subsequeinee^ U^i^rstof (Siambfook IMS, supm}, m40m4 
hereiii. 

The pfSi^nt InveMcm ^Iso r©i;ate$ to isdE^ted fiuclefo gteld seguence^s jsroduc^ed by (a) 

10 Hybridizing a QUA under ve$y tew, Sow, medfum, medium-Ngh, high, or: ve?y high strlng^tioy 
eondltujrss wit! (i) a poiyniicleotide pf the Inventlorii. (is) a subsequencje of 0), s?r (lil) a 
coiirtpi^rnsntafy strand of (f)* or (ii); and (fo) Isoiating the nucleic acid :se(|yer^c^. The 
subsequence is pmferabiy a sequence at isast 100 nuciedtldes sucfe as a s^udhc^ that 
snfeodes a ijbiypejjtide fragment whlsii itas proteose activity. 

1 $ the Introduction pi a mutaliqn into the nudele acid sequence to exchange one r^uclectide 

fer another nycleotfd® rr^ay be ascompiished by site-dii^eoted niutageii^sss using any of the 
iTi^fhods kiiown In tf>e art, Par^'cular^y usefiJi Is the- procedure that uillzes a supersolled, double 
strandsd DMA v^tor with an insert of Interest and two syiiithetic primer^ csjntsinii>g the desired 
mutatldn. The oligonool^otide primers, each compi^smentar;,' to opposite strands of ti"?e vector, 

20 e)itend diirlng temperature sycHng by means of Pfu DNA polymerase. On ii-scorporatisn of the 
primes, a mutated plasmid contsining staggered nicks is generated. Following temperature 
cycling, the product is treated with Dpn\ which is specific for methylated and heml methylated 
DNA to digest the parents; DNA template snd to seied for mutatlon-centaining synthesized 
DNA. Other procedures .kr?owrs irs the art may also be mad. The invention aisci relates to an 

25 Isolated polynucleotide encoding a poiypeptlde as defined In the «rst aspect, 

Nudeic Add Constructs 

The present invention also relates to nucleic acid constructs ocrtiprising a nucielc sold 
sequence of the present Invention operabty litiked to one or more dontro! $es?iyencea that direc^^ 
so the expi'ession el liie coding sequehpe in a sultabie holt c0ji under eondttlons porppatitlie with 
the coritml ssquences. E?cpnsssidn wlli Se understded to include any step Involved in the 
production of the polypeptide Including, tut not ilmited to, trsjiscrlption, post-transoiptionai 
i'nodfeatlon, tJ^rtslatlohs post-i>'anslational modification, arid seciF«tion> 
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"Nyda'c add consirucf' is d©ihed hsmsn as a nuoMo acsd motecute, sther singte- or 
douMe-slrand©4, whfc^ is isolated lirsjrE? a naium%^o gens or which has bfen mc>^^ to 

contain s^merits of nucleio aold oomb-lned sjid Ju^ap^ m a mMnsftiiatwouSd not otherwise 
exist in nafor^/Tlie t^rm Tiuclelc asi^^ wstruotls synor^fnous with the term sxprsssion cassette 

S when tfi© hudeiq acid eonstruct oonMns aii fie eorstrpl separices r^oired for eiKpresssoii sf a 
ees^lng s^cfu^sas of ths pj-ssgsnt IrwentJon, The term *coding se£|uen<se" i$ sMnet;? horaln a$ a 
nuofefc aeld sesiManc^i tfeat dimdiy sjijesi^^^^ sequence of i^ pjpteih prpcfuc^ Th© 

boundaries of tj^ mdlng m^umfm w<s^ gmm^i^ detennlned by a dbosome foinsling sit© 
Cprokaryotes) or the ATO start caden {euk^iryoles) loc^teid Just upstrsto of tie o^s^ reading 

10 frame at the S end of the mRNA m& a tr^nsaiption tarminator sequen*^ focatsd Just 
downstream pf the op^ readlhg fmme the 3' of the mR^A. A ebdiiig s^^iwehce can 
Inciud^, but is not limited to, DNA, cDNA, 5tnd mcasmbinant nuoieic acid secjuences. 

An isolated nudete Bcid se<j ^nOodirig i pojypesptide of the preseni invention maj? 
ise rnanlpu(at<ed In a vaii^ty of ways provide fbr expfesaton of the polypeptide, ManipuiaSon of 

IS tl^ nucieis aeld sequence prior to Its insertion iritd a vector itiay deslrabia or necessary 
d©p©neiir5g on the expfes^ion vector, Tiie techniq^.ies for modi^ihg huelete add sequences 
utiNng reeombinant DNA mdhods are wali known in the asi. 

The term "control sequencers' is defined herein to inciude ail componsnts that ars- 
nacessary or advantageDus for the expression of a polypeptide of the present Inventsort, Each 

M control sequence may be native or foreign to the nucleic acid sequence encoding the 
polypeptide. Such control sequences include, but are not limited to, a isader, polyadenylation 
sequence, propeptide sequence, promoter, signs! peptide sequence, and trarjscnption 
tsrminatQr. At a niinimuui, the control sequences inciude a promoter, and transcriptlcnal and 
trsnslstlonai step siguais. The control sequences may be provided with linkers for the purpcs© of 

25 introducing specltiu restriction sites faoiiitating legation af the control sequences with the coding 
region of the nucleic; acid sequencs encoding a polypeptide. The term "operably linked*' is 
defined herein as s configuration in which a ccfitroi sequence is appnspriately placed at a 
position relative to the coding sequence of fce DNA sequersce such that the control seciuence 
directs the expression of a poiypeptide, 

30 Ti^e contFof sequence may he art appropfiate prohtoter ^uience, a nuoleio acM 

se«|uehce fiat Is reieognised by a host cs^i for expresi^n of the nucieie acid sequescei. The 
promoter sequence oor?taln^ traPs«iptionai control sequences ihkt mediate the expression of 
the polypeptide. The promoter may b& any nucleic add seq«en<^ whldii shows tranacnptional 
as?ti)«% fn the host cell of choios incliidt mutant, truncated, and hyhrld pfon^otefsi, and rfiay lie 



obtamed from genes ^ncodjnp ©xtmc#Msr or irt^miMar p0i>pept!des Mtimf homologous of 
beterotogous I o tlie host oeil 

Examiples of syitabie promcters fof dlmeling tfm tmmcfip^Qn of ih& nudsjc add 
oonstructs of the presarit ioyente; espeoaliy m a feaotens^ ha^ od!, sre th^^ promoters obtained 
S imm the E. q&H W opemn, B&Bpt(myo0 omiipdor gm^ {4^^% Bmillm mMMis 

Jevaosucrase (mcB), Saafte liahen^mfs alphs-amylase g&m {amyljf» Bmflm 

amy!as« §ene (amyQ), Saef//j^s §0^j©?j/fe??j?fs BaGilim mMB >i0im4 

xyiB gen^, arsd prokafyptfo b$ta^iactam^§0 g^gne (V5lia-il^mai*50ff «t 1978j Pfssa^lngs of 

10 tje lslagorjar Academy of Sciences USA TS: 372?-3731)i, as well as the tac prdmotsr (D^Bosr et 
ai, 198S, Proceadings of fh^ Nationat Academy of Sdeiiio^s USA SO; Ei-2S). Fuftier pFOtf oters 
are described In "U^^ful protisins from recombinant bacteria" in $^deriti^cAh1ericant,1#0, 242: 
74-04; and In JSambrook et ai./tSSSj mp^a, 

Examples of sifMabfe isromoters for direc^iig the transcnption of the rsucteic acid 

i& construes the pressjit iiivention in a fllamsntosis ftjngai host Cieil are promolefs obtain^ trom 
the ganes i&t AspBtgllim oiy^aa TAf<A: amylase, RhizomtHx>r mi&h&i aspastio protefnase, 
Asp&tQllim mg&r neutfal: aipha-amylase, Aspeiyflfus r?fgsr acid stable aipfta-amySase, /Ispefgfe' 
mg&rm'Asp&rgilfus awamo;?' giucoamyJase {g/aiA)j Rhimtnucdrms^tmilipam, Aspetgiiim wyzae 
alkailne pmtsasa, Asjmrgiiius oryzm iriose phospiiate isomerass, AspetgHhJs hsdakms 

20 acstamidase, ar'.d Pasadmn ox><apm/m trypsin-itke protease (WO 96/00787), as wsll as the 
IMA2~t.p5 promoter (a hybrid of tfm promoters from the genes for ,4spe«?f&.s; ,,7/ger neutral aipha- 
amylase md Aspergillm o^yias triose phosphate isomoi-ase), and mutant, tr«neatedj and hybrid 
promoters thereof, 

hi 3 yefitst tiost, useful promoters are obisinsd from the gehes for Sscvhammyces 
2$ cerevisiae enoiass Savchmomyces c&m/isi&e galactoKinase (GAI1), Sacchammyces 

mmvlsiaa alcohol deny^trogsnase/giycersfdehyrfe-S-phcsphate dehydroger^aso CADH2/GAP), 
and Sac?c/?ammyc»s cer??i^/sfae S-phosphoglycsrate kinase. Other useful promoters for yeast 
host eels are descnbed by Romanes &t aL, 1992, Yeasf S; 423-488, 

The control sequence may also 'm a sultal^t transcrlpSofi tesmintator sequence, a 
3Q sequence recognized by a hd^t Ceil to femiinate trianscnptlpn. T!ie terniiiiator sequence 1s 
operabiy linked tc ths 3* tesnninus of the nilcy c acid sequen?^ encoding the polypepllde. Any 
terminator which is functiopai In the host eelf of choide may be Used m iiie present ihvefitidn. 

Pref^h'ed terminators ior fliamentotjs fungal host cells air® obterned from the genes for 
Asp&rgiBm ofym& JAKA amylase, A^rgSlkis nfger giucoamyiase, Aspefgiilus niduims 

20 



anthmnliafe synthase, wigsr aipha-gSoccisidase, and FusBsiam oxysporum tr>'psfr5-iske 

protease* 

Preferred tferminstorf % hmt cs!is are obtained fmm ths psn^s for 

Samhammym^ c&myisim mc^sm, Sac^hammyms cm&ysslm cytischrome 0 and 

t^mhatos^ fe yeast host ceils are descnbed by Rpmanos ata/., 1992, sapm. 

Praf0rm<j tarifiilnatars for feacterl^i lidst ceSs, sush sas a Bm(k^ hoM ss®)!, am tH# 
t^mimfem from &Bd0m Sot^nffomiB aipha-atr^lase gsn© (amyl), the Bacfihs 
&i&&mth@rrmi}hiim m^tisQ&ff^^ or the Badtlus ^myldi^a4^Ba$ alpha- 

10 am^ase gehe (amyQ). 

the cqntmE sequance may also be a suit^Me teaii^r s©quehc©i a nonHansiia^ mgm of 
ah ftiRNA Whfet? s$ important: for tfansiatsGn by ih& host esiL Tiie badar saquence is opsrabiy 
TinHsd to Ihe 5* termifsus of the rtyeie^c sdd shushes eiicddini^ the pdiypeptide. Ariy leader 
^gsqussnce thatls functional ih thi© hcst cell of choice: may fete used in thfe present In^ehtSdn. 
15 PrsfbfTed ieaders for fiiarngsntpus fungal iiost mih are obtalried ftpm the genes for 

Asp&0!!mQfyz8& TAKA amyiaisgs and Msp^/lf^s /f/cfwisfjs tfiosfe phosphate fsornerase, 

Suitable leaders for yeast host cei!^ af^ ©Mained from the genes for Sacc^jaromyaSiS 
Gemvisiae enoias® pNOI), SacohmimymB cerevmiaa 3-phosphoglycerate kinass, 
SBGcMromyces cerevislae alpha-factor, and Saa7,*7an:?m>'f;»i? a&myma^ aicftl^ol 
20 dehydrogehase/g!yc.©rald©hyde--3-phc>sphats danydroganase {AD H2/GAP) . 

Ths control sequence may also he a polyad-snylatson sequence, a sequence opefably 
linked to the 3' terminus of the nudeic add sequence and whsch, when transcribed, Is 
recogrslsred by the host call as a signal to add poiyadenosine residues te transcribed mRNA, Any 
polyadenySation sequsrsce which is fuocfcnal in the host cMl of choice may Pe used in the 
2S pfssent ioverstion. 

Preferred poiyadenyiation sequences for filamentous fungal host cslis are obtained from 
the genes for ,4sp0.fg#j,(5 or>2ae TAKA amylase. 4sperg*«/ys nsg&r glucoaroyiase, ,4spo/y///ys 
nlduiam anthranliate syiithase, Fmmum OHyspowm trypsiri-llks proteases and >4>sper§i;&s mgw 
alpha-giucosldase. 

SO ys#ul polyadenylatft^ seqiieiices for y^ast host eeli^ are described tsy <3uo and 

Shernian> li&S, M&^muMr CBfiuiar&mlogy liS; S983-59SfQ< 

The cPntf'ol ipquenca may also be a signal peptide codl?ig region tiiat codes for an 
ai^iioo aeld sagusnce teked lb the amino tsrmiiius of a polypeptide and directs the eheoded 
poiypepMe ihto the csfs eeprefory pathway, "The S' m4 of th© cbdlrsg seqdehce of tlita nudeic 

21 



acid sequenss may infi^^sntly csntairi a slgna! coding region naturaiiy feked Ir^ 

translation reading frame y\Mh fje segrnent of Ife© codSrig region wH'ich enccsctes fie secrstf d 
pctv'peptide, Merrjativeiy, th^ S* end of tie m^mg mqnmcB may contain a ssgrjai fsepMe 
coding ragiors which Is fomip to the coding sequenca Tft« fmmm sigrssl peptsds oedirug T«f son 

S m^y be reqyirsd vs^ere %& coding s;equ<^e# does n^t natyr^ily contsin a signsl peplsde coding 
region, AftematEveiy^ th^ foreign signal peptide coding region may simply replace the natuml 
signal peptide coding region Jft 0rd# to^hsnce secre^<>n of poiyiseptid^, Ho#ever, miy 
sigrsal peptide coding region vs^lch direcIS: the axpmssiKi poij^^ptide inic secretory pathway 
of a host ceii of choice may be ffs the pre$snt1n\feft8on- 

10 EffmMm sighai peptide coding regions for ba<*enai host s^lis are ths slpsi peptide 

cpdlr>g regions sbtalned from the genes for Baalim NCiB maltogehle amyiase, Bm0tis 
stmmthemophitas aipha-amj^ase, Badifus ihheniBrmis sybJiiisin, Ba&llm ^htiensfomis Mpha* 
amylase, Baaffm s?eamS^efmopM«s heutral proteases (npfT, npfS, npH^), and Ba0iim siMBs 
prsA, Fuflhef" signal pep^des are dssc^bed hy Simonen and PalvSj 19^3, yibnsbiologlcai 

Effective signal p^lkl® m^m§ regions for fflamerSoi^ fUngai host ceils are the slgmi 
pep!i€i© coding regions obtained from the genos for Asp&aWkJS iQry£&0 TAKA amyias@i 
A$peiylilm nigBt neutral amyias©, Asp&rgSim niger giuooarnyiase, Rhizamucar mfel?e/ aspartic 
pa-steinase, Homicc^a lmahnB oe!|yi8$8> and Humicola iamgkmsa lipase- 

20 IheM signal peptides for yeaist ho$t ceils are obtained from the genes for 

SasoMmmyGss cBievlsiae aipha-taclor and SaccJyaromyc&s cerevism& Invertase, Oth^r u$efei 
signaii peptide codir?g regions are described by Romanos etai, 1992, syp.®. 

The controi sequence may aisc be a propeptide coding rBglon that codes for an amino 
acid sequence positioned at the afnin-s lerniirvus of a polypeptide. The resultant polypeptide Is 

2S known as a proenzyme or propolypsptide (or a zymogen in some cases), A propoiypepticle Is 
generaiiy inactive and can be converted to a mature active polypeptide by catalytic or 
autocataiytic cieavage of the propeptide frorr-s the pFopclypeptide, The propeptida coding region 
may be obtained from the genes for 5'e<^'«« 3adt.«/vS ailcafine protease {sp,r£), Bamikm sabtilis 
neutral protease (nprT), SBCctiammyo&s cerevlsl^se alpha^factor, 

30 proteinase, and Mymihph§iof&ti3^0phlte lacsjase ^0 p/33?36). 

Wh^e botii signal psp€de ind pfop8|>tide rsgjons Me pmsent at the amrno ternilnus of a 
polypeptide, the propeptide region is posffie^ied n^ fe the arnino t^lnys of a polypeptide and 
ttie signal peptide region is positioned ne)d to the amino terminus of the propepid^ reglors. 
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II may also b# <fesifsfe!e lej> a«M regulatory sequerKJ^ whicli aOow tJie regulation of ths 
ex|?^$foft pi th® ppiypeptide refaiive to the gmwU) of tfe host ceii. Examples of r«gu!alory 
systems am those which cause the esipr^^^ of the gane t© be turned on oroff in respome to 
a olii^lcsS Of physical stimuitis, mciuding the presence of ^ r&suMor^ mmpmnd^ iRsgylstory 
S systems In prokaryoSc systenis fhdude the: /ac, Isc, and tfp Pp^t&r f^-^^imm. in jem% thfe 
J&it system ©r CsALl sy^^tem may be useci> \n Mm^nipm imgh the TA?^ aiphaf-amylas© 
pforootsr, A$00!im rslg&r gitjcoafnylase prpmot^^r, and Aspergl^iu^ i>fy:sm 0mmrns^BB^ 
pmmoiumBfM m^ m reguiatoty sequences. Ofeer examples of r^gyatory sequsi-sses am 
those yifhidh allow gerie: amlJilfidate, it\ euH^otid systems, thesii irjdudg the dih^rpfoiate 
10 r^ustase gene which is amplified in the presence of msQioirexate, ar>d th@ mstalbthjofidn 
gertes which are amplified with heavy metais, in feese cases, the huctetc add saciua^nce 
encsddiffg the polypepticfes would fee ojwabiy linked wllh the i^gulatory sequence. 

Expr^slon Vectors 

If Th4 presfeht Invention also relates to reoomblnaftt Mpms^im yectdrs comprising a 

nuclsie acid sequence of presadt Invsn^lon, a promoter, artd traBscripfionai and translatlonal 
stop slgnt^ls. Th© various nuclslc acid and control sequences dsscHted abcve may be joined 
together lo produce a r^mbinant expression vector which may include one or mere Gohvanient 
r&strfcticn sfes to allow for insertion or substltytlon of the nudeic add sequence encodino the 

£0 polypeptide at such stes. Alternatsvs'y, tha nuciek; &cki sequence of the present Inyentlon may 
be expressed by jnsertsr,g the nucleic acid sequence or a nucleic acid construct comprising the 
sequence into an appropriate vector tor expression. In creating the expression vector, the coding 
sequence is located in the vector so that the coding sequence Is of?8rably linked vvlth the 
appropriate cont'ol sequences for expression. 

2S The recombinant expression vector may be any vector (e.g., a piasmid or virus) which 

can be conveniently subjected to reccriblnant DMA procedures and can bong about the 
expressiv'sn cf the nuciesc acid sequence. The ciiolce of the vector will typically depend on the 
compalll^liity of the vector with the host cell into vsihioh the vector Is to Mf introduced. The vectors 
may be linear or olpsed dircular plasmids, 

30 Ths vector may be autdnPmousI^ fepllcafing ves^or, /. e, , a sm^or which axlst$ ss an 

extrachroffiosomal eritlty, the fgp!icall<Hi of which is Mepsnd^ent of ohmmcsom^ rspHcatlon, 
e.g., a plaornsd, an sxtrachromosomal elefnept a miriiCHrompsome, or an arllfidai chromosome, 
The vector may contain any means for assuring seff'.repfetlon, Mernsitively, the vector may be 
d?ie vi'hiish, whei? Introduped Into the host celt is Inttegrated shtd the gendme and nspllcated 



togeiha- with the c^romosome(s) into ?«^feh it has feeef! Intsg fatscl. FMrth^mom, s sifjgte v^sotor 
or plasmy Qj- iv?o m rnore vectors or ptasmsds wMch tog^her ojfi^fe t^e total DNA fo foe 
in&'eiduced into ths gesnoms of thies tet cell, or a transposon may b« used. 

The yeslors ei iii^ prssent invsntton ?s^fers|>iy contain pm ss; rTsom seleqIabiB marker^ 
§ wh?ch permit easy sfel&eilon of tran^^rmed c^lls, A sefectafate marka- is a gens th© pmdii^ of 
which provkies for bioelde m viral msislianoe, rasistsnce tci h&my m^als, pmipir^phy h 
auxoirophs. arsd tile ilka E:^ntpfes of bactsna! iectabl^ markers are the dml gen^s fttjfri 
B&dMs mb$k ot BBoillm H^nihm^ Suial5le marl?^ for yeast l!0# coll^^ are AD!E2j HiS3, 
LE02, L¥S2, METS, TRFI, and URA3. Selectable madtsrs for use in a iiam^ntpus fmg^l Ixsst 
to cell includSi but m& not limited to, amdS (i^ataTiidase), argB (ornithine caHasmoyitransfefase), 
bar (phQ$^lmiHtMn Bmi^^mmW^^ hygB (hygrdmyofn phpsphotransferssa), »/aP (nitiiate 
reduetass), pyri^ (orotidine-S'-piiosphate cJecaftiSxjflase), (suitbte adenyltransferas©}, ttpC 
{antilrianilat® s^Miase), k& m0 as e^uivaients t^jerbdf, iPreferr^d for use m m Mp^iyiifu^ i^^ 
sim the amriS ahd gefies of ^spergto nkk^ims m A^BtyiSus Qryza& and the fjar pane of 

The vectors of the present Invention preferably contain an feiemenl^s) that permits stafel© 
Integralion of the vector into the !io^ deil'^ CfShomo or autonomous irepiiGStion of the voptor In 
the ceil Independent of the genofns- 

For integration into the host cell genome, the vector itiay rely on the nucleic acJd 

20 ss?quence encoding the poivpsptide or any otiter elemsn? of the vscior for stable integration of 
the vetstor Into the genoiris by homologous or nonhomafogous ^"ecombination. Aliernallveiy, the 
vector r^nay contain additional nuciaic acid sequences for directing integration by homologous 
recomisination into the genssir^e of the host cell The additionss nucleic aoid sequences enaijie 
the vec&r to b© integrated into the host ceil genome at a precise loc3tion(s) in the 

2S chromosomo(s). To sncreaso the iikesihood of integration at a precise location, the intsgrationsi 
©lemenfs should preferably contain a sufficient number of nucleic aoids, such as 100 to 1,S00 
base pairs, preferably 400 to 1,500 base pairs, and nm-t preferably 800 to 1.500 bass pairs, 
wiiich are highly homologous with the conesponding target sequence to enhance the probabiiity 
of homotegpus recontjinallon. tlie fntegratlonaf element may be any ses^yence that Is 

10 homOiogoys vsflth the target sacfuenp^: ili the genome of the host Fyrthermore, the 
integrationai elements may tse noivencodtpg or ^eodfttg na(^elc acid eeE|uencea On the other 
hand, the vector may be integrated into the genome of the host ceil tsy non-homologous 
msombfhation. 
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f0t ^i^omrmm m^ioMlm, the vess»f rrsa:^ fartfsef conipfise m origin of mpifestlon 
enaMwg tlie vsctor to repffjoate^ytQnomousiy in tf^s host ceii in qyestlcssi. Ex8tmp|e$ of foai^arial 
origins of reppcatiOfi fh® &t\0m af m^cMm of plasmtds p8R322, pUCl9, pACYCIT?, and 
pA&fOiB4 pffmittlrsg mpiisatfon m E cdl, and pUBIId, pEiM, pTA1C|60, and p^UM 

S p^rmltling i'^pilcation In BBmHm- lExart^pf^ s?f oi1§fh$ of rspifesSon fer us® Ifv a ysast host cell 
are the 2 miaon ORgin of mpibatlon, AR;Si , kR$4, th© combinatfen of and md ths 
<^mb|r>atsori of ^??S4 and CEN6, Th© ongsii df rfeplicatipn iii^y b« oneltavlng a mutation wllitiSi 
mate it functioning temp©mtUFe>-Siginsits^^ (see, e^g., Elirlich, 1971, p«>c@@#3pf 

afB&N0pns^Ai:^d0mf0fSaipr}^ 76-: 1433), 

16 yore tiasi one oopy of s suoNio aoid: sssqysnoef of ti^e prei^rst invsrvtion may ipe Inserted 

into tie host celi to increase prociM#on of tlie gene product. Ars lnGr0s$e sirt the dopy iiumlssr of 
ths nuciefc acid sequence san bd oM^inad by infeoratrng at bast one acSdilloJia! copy of ite 
ss^ussRos Into tiie ho^td^l! 0®noma or by fhoiudiiSg an ampHfsaSiis ^el^i^ble hiarket^gen^^ 
the nudeic aeid seqiuencs® where cells oorrtalning ampliliad coplas of the sdactabid marker 

15 gene, and thensby addlljpnai copies of tie nucleic scsd sequande, cart ba selected for Iby 
GuSflvating the c^ls m the presence of the appropriate sslsctable ageni 

The procedyres used te ligafe the etements described above to cor^striid the 
recdrnbirtant expi^sslon vectoi's of the present iov^tsc^ are wa!i known to ona salted in tha ajt 
(sea, e,0„ Sambrook etal,, 1989, s«/?.fs), 

20 Tho protease n^ay also bs CiS -sxpressed together with at least ors-a other s^iUyTne of 

i!>terest for ansmai feed, such as phytase (EC 3,1.3,8 or 3,1,3.26); xylanase (EC 3.2. 
gaSactanasa (EC 3,2.1.89); aipha-galactosidase (EC 3,2,1.22); proiease (EC 3..4,-.~), 
phcspholipasQ A1 (EC 3,1,1.32): pinosphoilpase ,A2 (EC 3/1. 1.4):; ^v'sophospholspasa (EC 
3,1.1,6); piiosphoiipas^ C (2,1,4.3); phosphoiipsse D (EC 3,1.4.4); aniiior beta-giuoanase (EC 

2S, 3.2.1,4 or EC 3.2.16), 

The enzymes may be co-sxpressed from different vectom, from one vector, or using a 
mixture of both iachnicjues. When using different vectors, the vectors may have different 
seiactable markers, and different origins of repiioation. When using oniy one vector, the genes 
can be sxprassed frors^ one or more promotaFis, f clonad under the rsguiatlon of one pfofmotar 

3S {d!~ or mtslii»di$fronfo), tise order in which the gerjes are doped itsay alfeoi th® axprassion levais 
of the proteins. The protease may also he expressed as a fusion protein, la, that the gene 
encoding th© proteaso has bae^ fused in fi^mfe to the gene ensMding another protein, this 
pi"otein may ba another ansyme or a functional domain from another dr^yme. 



Acodrdsngly, the Invention also relates to a mGomi^imrst expmsston vector df 
jx>l*/nuef«oSde construct com^^ a polynucteotide 0f the frwei'vion, 

S Jh^ present inven^n ateo rei^tigs to reporritisftant h«?st csifej eernpnsing a jiUclac aoSsI 

sequence of the inventscm, which are aclvantageoysly usss^ jrs the rssomfoinant producSen of the 
polyp^tidesi A Vector qpmjjrlsins s rtuelf ic #qld^ig<|uano^ o ifeie pre^rst mvention introduced 
IMo a host <5©i| so that tie ysctor b maintained as a ehFomosomal integraht or as a sell^ 
rapjicailiig ©xtra-chromosemal vector m deseiliseci e^riler, The term *'hoet celf enajsiftpasses 

1 0 my progeny of 0 parent cell; that le not identice! to the paraiit cefi dm to m utat!0??e t^M aeour 
dyrlog repHoatlon. The choice <5f ^ host ssefl to a Urgs extervt d^pand upon the garsa er^coding 
poiypeptsda atid Its sourc©. The host se!i may be a isr?ic^!yar micmorgahism. B.g., a 
prokaryote, ora rjofirumceSiylgsr mlcrosjrgarjismj ^,g,,k eukaryote, 

Useful tihicsHiiter <se]!s are bsstesia! eelisi suoh as gram posstiva baclana Srtduding, but 

IS not Jimled to, a BaciBiM ceil, or a S^mpt^Mstiya&s ceS, or tsdh of i^cUc acid bactsHa? or gram 
oegallv© IsacteHa such E a>// and Pseysbiwonas sp, Latstb aoid baetsria include, but are not 
limited to, speciea of the genera LqcUxxk^cl's, LactoMdHus, Leaamosios, S^Bpitx^MS, 
P&diocoa&m anii EntmKiaocouB. Ussfui unicellular ceils are baderiai ce!!e eudh as grarrs positive 
fescteris including, but not limited to, a BaoiBm ceil, &,g., Bsdffos eilkahphiius, BbciHus 

2S amyioiiqaefacmns, Sacilhrn hmvis, 8aciHm cmukms, Bm:il!m clausIL Badlivs cofigafaas, 
BbcWus hsutus, Bacflhs hntus. Ssclilus iichenikfrmss, BacHms nmgaf&riurr), BBCilius 
stearoihemiophilus, BadHuB subtiliS: and SaciiVi/s tharlngierisis: or a Sf/epj'am,K::es ceii, 
Stmpiomyoss IMiians or Streptomyces murinm. or gm'n negative bacteris sucti as E coli and 
Ps&udonioaas sp. in a pr6?ferreci erobodsment, t!"5e bacterial host cell is a SacM/s fentm, 

2S Bacifkss iich&nsformm, Badim Sieamthermophlkm ot BmMm mMiis ceil, in anotiier preferred 
embodiment, the Sadi^ys coil is an aikaiophiite SaGfii(;$, 

The introduction of a vector into a bacteriai host eeii may, fbf instance, be elfeded lay 
protoplast transforrration (s^, e.g,, Giiang and Cohen< 1979, Mdocakir Gen&ml Q&iwtscs 168; 
111-115), ut!«g compeksf^t ceile ($ee» e.g., Young ar^d Spizizin, IB&i. jQum&lof &aGtBrlQfogy 

m 81; 823~02§i or Dufehau ami Davidolf^AfeeifeOfiii 1^71, jQiim^l af MslBQakrBlQhgy 56: WB-^i}, 
©lectroporatbn (see, e.g., Shigekawe artd Sower, 1989* BhtBohntqmB 6; 74Z-^7^% or 
conjugation (sas, e.g., K:o#[fef afid thorne, 1987, Jmmal Bactenotogy 169; S77t«5278), Tiie 
host dali nisy tee a eukaryste< audi as a ndn-^hurrian anirnal cell, an insect cell, a plant cail, or a 
fungal cell In or?® paitimisr emhodfrnent, tha koM e&li is a furjga! cell, Tungf is used hamin 
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includes the phyla Ascomycota. BaS'frfiomycotaj Oliytndsdftiycota, and Zygomycola {as slsfrnsd 
by Navs^MJEth &t ai. , h\ Ammor0 8/?cf Bmbf^ DMhmfy of lTm fmigh Mh BMm, iSiSS, CAB 
Meropfioflai, UnfVer5% Press. Gam&nslse, UK) as well a$ M ©of^pols (as eltesl In 
Hmkmofib &i al., i^S, supra., page 171} and aii mstospofie fungi (Hawkswort^ %*a^^ 1906, 

in anofeer fsartleutar lemfejcxlimsr^, the fungal Ho%t csli Is a yea^t t^l!."Y8a$f ^ used 
hsresin inclitdes aacQSiprogsnpus a^t (Erjdf?mycsiafes), bai^<;lfosp0rogenc^^^ ypaM, asd y^aM 
beionglng io the Fungi im|5effe«^l (Biastoifnyoefes^ Stns:^ tJ^e dagsl^catlon of y^si may cte>g® 
io iis futMre, fortse jSyrppses csf this ifiverlison, ye^^ be dfefetj di^crlb^ In Bfdtogy 

10 0nsi ActhfittBs of YmM {Skmnet, f >A„ Passmore, S>M-, and Davenport, ecSs, Soa App. 
Sacfe??pl, Sympo^umSmm Uo. B, 1880), 

Th<e V^ast host cell may bs a CBa^ld^ HammuiSt, l<iiiyv&rcmy<s&s, Pichla, 
$mohmmyces, ^^immmhBmmyaes, or Ya^Hf/a^ell 

Thfe fungal host cell fhay be a filamehtous fungai <^ii, Tliamsntous: fengf include ail 

is fsiamaritous f«^rns of thgf sutKiMslon Eumycota and Oomyco^ {as dafini^ by Hawkswsjrtb # ai, 
1995; sapir^)/ The fsiam^htous togt ai^ ciHai'actensied by a !iiy<^ia! wall soniposed of chltin, 
c^uidse, glycan, chftossn, manna?), and other oompisK polysaccharides^ Vegatailve gmwtf^ is 
fciy hypha! slongalion and earbon cMaboto is obligately aerobic, in contrast vegetstive grswth 
by yeasts suoi) as Ssco/?arDmyc«*s comv/s/ae Is by budding of a unlceilufar thalius and carbon 

20 cstabbiism may i5e fermentative. 

Examples of filemantous l'ui>gal host ceils are Ciglis of species of. but not limited to, 
Aamnomjrn, Asjs&rgiHas, Fasamtrh Humi&dB,Miimf, MyceSbpMJom, Neamspom, PmicBfmm, 
Tfmlam.. Tofypoamimm, or Trthoclemm. 

fungal cells may i>e transforfrjed by a process involvii^g protoplast fosination, 

2§ transformation of the pratoplssts. and .'-sgerisrstion of the ceif wall In a manner known per m. 
Sultabte procedures for transformstion of ^sperg^J/j/s host cells are described ^n EP 238 023 ar^d 
Yolton or a?:, 1984, Proceedings of ihe National Academy of Sd&noes USA 81: 147D-1474. 
Suitable methods for ti-ansforming Fifsa/fi^n? species ara described by Malerdfer or al, 1889, 
Gem 78; 147-156 and WD 96,^00787, Yeast n-iay 136 transformed tfssng ma procedures 

30 described by Bscfe md Gu^r^ta, M Afosisdn.; J.H- and Simon, M J, , editors, Grndk h Yem 
Qemtm md MBoMrBkmgy, liMods is pp 182- 187, todemb 

Press, inc., Naw York; Ito eial., 198S, Jayf^a/ofBaetedosfoa/m and Hinnen otaL, 1978, 
PmGBedmgsWthe Matmna! Aoa(iemy<^^ 75; 1920, 
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The Inveniioo relates to a (■acomSmant host cell cQmprisfng a polynucieatide of tlie 
sfiventid?'?, or an ei^ression A^qr or poipuclsqiide constuci of the Imwiion. In a pret^red 
ertsbs^femnt, the ree0mbi!iaht M^i e^S Is W- Bacsliiisc^ll 

Tne present tevention aiso relati^ to a irsnsgenle pim% pto part, or plant cell v^lch 
has bBBn imm^om&i wkh a nucfeic a*5td ssquersqe sncqding a pp^peptfde hmm pfoimm 
ac^Vilty of the present invention so as to express and pmducs the poiypeptSsie m foo^verab!© 
qMantities. The polypeptide f?tay he repovered torn the pfanf or plant part. AStoma%e!y, the pla?ift 
ID or plant palt cehtamlhi the feoombimnt psjlypeptlde may bs used as spoH for ImprovlJifg the 
quality of a fesd or feed, e.g., irnpra^hg nutiltianai value, n^atabity, and rheol^tqal prspsftlesv 
or to d<&strey an ahSriutritlve factor. 

In a particular embodlhient; the pdv^peptlde is targeted to the er)idosperm storage 
i?acuo!es m seisds. This cah foe obtained by s as a precursor with a sui|ais!e sighai 

is peptide; see horvath^ a! in PNAS, Feb, 15, 2000, vol $T, no, 4, p. Iii14-19i9. 

The trat^gsnfe plant cers he dlcotylsdonous ta dicc^) Pr rhorjocotyledorious (a monacot) 
or engineered variants thefeof- Examples of monjxjol pbnts are grasses, ^suph as meadow grass 
(biye grasSj Poa), forage grass such as festuca, foisurrJ, temperate grass, tmh as Agrosls, and 
eereals, e.g., v/heat, oatSj ryej baHey, rioe,: sorghum, ar5d ft^ake (cprr^), Sxarftples of dlcot plants 

29 args tofeasco, legumes, such as iupihs, potato, sugar beg^^ pea; bean and soybean, arsd 
Gruclforous plants (family Brassicecsae), such as oeulHRower, rape seed: and the closely related 
model organism Ambidopsis thaikma. Low-phytate plants as d>ascnbed e.g, m US patent no, 
S,988iD§4 and US patent no. 6,111168 are examples of sngsneersd plants. 

Examples of plant parts are stem, callus, feaves, root, fruits, seeds, and tubers. Also 
2$ spocffic plant tissues, such as chloropsast: apopiast, mltochcndna, vacuole, peroxisomes, and 
cytoplasm are considered to be a plant part, Funhermore, any plant cell, whatever the tissue 
origin, is considered to be a plant part. 

Also Included within the scope of the prasent InveMlon are the progeny of such plants, 
plant parts and plant c;e!ls, 

30 The Iran&genic plant or pfeM cell expressing a polypeptide of the present invention may 
be cdnstruGled in accordahoe with methods known in the Briefly, the plant or plant cell Is 
constructed t>y Incorporating one or more expression oonstruots encoding a polypeptide of the 
present myentlon into the plant host gendms ahct propagating the resulting modirled plant or 
plaot Itito a trajisgenic plant or plant ceil, 
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Gorwenfenliy, ths sxprnsssosi eonstfuct Is a nudele add cortstifUGt which cdmptrlses ^ 
niKjteic add sequence ^coding a poi>'psptscte of th^ pm;^nt InvMtloo operabfy llnis^ witlhi 
appropdate regulatofy sequences requirf d for i^xpre^sjpn of ihf nydejc seiqiJtwc^ iw tih© 
plant Of jslsn^t past of eNfc^- FMife the e>^msslon oamtruot may consipsise a s©SectaB!e 
5 marker useful for Ideitlfysng host c^lls fn|o whfOh the expression con^rubt has bmfi migrated 
and DMA sequences nscessaiy for btodusllQh of the construct Infe the p\mi in ^im&^on (th& 
!a^fer depends oil tis QH^ mtrddtictiOf? i^^tbMto J?e us#d). 

The oNoioe 9f mguypjy secjuenssesi, suoh as promoter ^nd fermiOBtor se^uejic^$ and 
optlonaily' sfgnsi or transit s^quenoos are deferrrHrt^, far exarnple, on the feasis of when, wheres 

1Q an<i how the polypeptide Is desired to i?e axpressed. For msteoce, the ©iipressSoft of the^ene 
enipodins a poiypeptkie of the present Invention may be cpfus^^tutiye or indiidlirfe, or may l?e 
davelopmentai, stage or tissue specsfsc, and the gene product may be tar;§etes(t to a specffe 
t^^ue or plant part such aa seed Regulatory sequences are, fsr example, descfllsed 
by f ague &i &L, IMS, :f*%s?ofegfy 86: §0S. 

16 For constitutive expression, the 3SS~Cay¥ pwrnoter may be used {f rencic e( aL 1^0, 

C^} 2t, 2B5~294}. Orgah-speiSiia promoters may be, for exampl8i ^ promoter from storage sink 
'^SSM^s^SU^'-a^.'SEMBd^ -p^o tubws, s«^MMs '^«|«^r^:& Con.iad. 1990. Ann, Rw. Genet. 24; 
275-303). or from metsbolfc sink tissues such as merlstems 0to et aL, 1994, -Plant Mot. SM 24: 
863-873), a ssed specific promoter such as the gfutelin, prolamisx globulH: or albumin pnomoter 

20 from rice (Wu ef a/., 199S, P/a.'7f amf Ceti Pt>ysiology 39; 8SS-889}, a 1%^ tuba promoter from 
the legumln B4 and the unknown seed pmtein gene from Vicla faba (Conrad ef aL, 1998. Journal 
of Plant Physioiogy 162: 70S--711), a promoter from a sesd oil body protsin (Chen at aL, 1988, 
Plant and Cell Physiology 39: 93S--941), the storage protein napA promoter from B.fas^fe? rjapws, 
or arsy other seed specific; promoter Km\m In tl>s art, e.g., as descrited In WO 91/14772, 

25 Furthermore, the promoter rnay be a leaf specific proi-noter such as the mcs promoter from rlce^ 
or tomato (Kyosiska a/:, 1993, Pfe/!f P%s/o/og>' 102; 991--1000, the chtoi-ella virus adenine 
methyltransferase gene promoter (Mitrs and Hlggins, 1994, Piani Mdecuisr Bhhgy 26: 85-93), 
or the s.^c/P gene promoter from rice (Kagaya el e/„ 1995, Mo/eci//ar end Qemra! Gemtics 248; 
86g-ST4)j or a ^-oynd Jnduc&le promoter such m the potato pln2 promo^;er .(Xu of al> 1393, 

S9 Bant MoiBGUisrBlofDgy 22; 57^^88), 

A promoter enbanser eiement may a!so be iBed to achieve higher exiiresslors of the 
protease In the plant. For hmkSmsi, -j^-'pori&c^-^haiow: i^wnent may be an intron which Is 
placed befesfeer^ the prortKstsr and the huciedtfde ssquerrce encoding a polypeptide of the 
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pmsmi Immtmn. For Instancs, Xu M 1883, sypm disdase tHe usfeofthsfir^lntron of the 
flee aetin 1 gene to ©rshsiic;® ejcppesslpn. 

Still furthfef, t?e eodors li^gg^ may be optlmigssd for Ih^ plmt spsfiles In qyesiion to 
Improv© expression (see HoEvath el al mferrsd to ai30¥s]i, 
$ Ths s#!©ctsible mafHer g#ne artdl any otiier psrts of the escpr^slon construct may be 

chosen torn thos^ avsiiiaSste h the sit. 

The nucleic scid construct is Incorporated Into the plarit g^om© aceofiiing to 
convanlon^ teehnlqiMS known In the art, Inoluding A^taoterfm^mdim^^ fe^Rsfomiafe, 
virus-mediated: traii$foitn#!pn, miemlnlsctloR, partiple bomfeardmer^, blolMlo l^nsft?miatei 
10 and siectfoporatlon (Qmrn- eiMi So'mm244: 1293; Potfyi<us, Bio/TBchmiogyM 
S3§- Shimamoto er#, N^trnMB: 2^). 

Fi-es^n%, Agmb&dBduw tm^mm-m^M&d gene transfer is the method of eholoe 
for g^Rerstlng fe-gnsgenlc disots 0or ^ re#Ws see Bopyk^ and Sdhllperoort, 1S92, P/a«f 
M&^eoular Bi&dgy 19: l 5-38)< However it csn also be used for transfontiing {tjonocots, aSthoiigh 
IS other iransfomiatlon mettissJs ars genef^ily preferfed for these planis, Fres^sntly, the method of 
choice: for generating transsehfc rnonocots Is psrtide bombardment (irsferescopte gold or 
tuftgsl®R particles coated with the transforming DNA) of er?>bfyon!0 cai oi' developing embryos 
(Chfistou,: 1992, Plant Journal 2: 275-281; SHimamoto, 1994, C(/??Bm Opj>>/on a?d6?t;?:i«ofo£r)' 5: 
158^162;; Vasi! ®f a£. 1932, Bixx'Teohnofagy 10: 667-674), Ab astomativs method far 
20 transformation of monocots is based on protoplast trafssfommtion m described bf Omirulleh el 
a/., Plant MoIeciJiar Biology 2i: 416-428. 

Foilowfng transformation, tiie transformants having incofposated therein the expression 
construct are seiecied and regenerated: Into wi^oie piants aQcording to methods weli-l^noyifn in 
the art. 

m The present invention also relates to n^ethods for producing e polypeptide of the present 

invention comprising (a) ouitiva^ng a transgenic piant or a piant cell comprising a nucleic acid 
sequence encoding a polypeptide having protease activity of the present invention under 
conditions conducive for pmduction of the poiypeptide; and (b) recovering the poiypeptida. The 
invention relates to 8 transgenic plaat or plant part, comprising a pofyrvusleotlde as defined in 

3d claim 8, or an expression yebtpr or polynudePtSde conslrijct of t\e invention. 

Animals 

The preseht inVentlori also rslates to a tfansgenic, non-hun^afi anlmai and products or 
elemesTts thereof, exarnples of which sre k)dy iuids such as milk and Umd, organs, Jesli, ahd 

30 



animal asl!s, Tedinseiuss for expressing proteins, e.g. in mammalian cells, are known In the art, 
see e.g. the liSEidbook Prciein Expression; A Pmctfca! Approach, HIgglns and Hames (sds), 
Oxford Uoiv^lty Pf^ s©f!es relstlng to G&m 

Tmmo^0m, RMA procsssirj^, and Posi-tminslstional Pmcassmg. {Saieraliy speaking, io 
5 prepam a transgeiifc animal, ;?eie»tedl c^ils of a s^tected anlmaS ar« tensl<>fH)«d pth a ouctelc 
sckf sequence encodisig a ipdli^peptld® having protease activity of the present myfenllo« m to 
express and >rp?J[uce tl^e polypept|(|e. The polypeptide msjr be r@c<3vei^ fron^i th^ aesmai, ig.gv 
fmm the mslk of femate ar^lmalSi or ihp pol^'p&pMm may be expressed to the benefit of tfie 
anima! ttseif, e,g, to assist the arthial's dlgss^orj, Exgmpies of animate are ma^^ m 

10 tbe sectipti headed AmEni^tl Feed. 

To prodiisa a transgenic animai with a view ip racqverlng the jiH-ptease ffpm tje milk of 
ih& anfmai, e gens encsssding the protease may b« Inserted lht<5 the- fertii^ad eggs of an animal m 
s^a^estion, e^g, by uss of a transgsne «sipresslors vector wbioh comprises a; suitahle protsm 
promoter, and the gene encoding protassa The ^ansg^ne expres^on vector is 

1 s mldrolf^ected j nto fertilised eggs, and prefarabiy pemianently lrifte|fatad mio the chromosome. 
Once the egg begins to gtg^ and divide* ti^^ pcstfer^^al ariiijryp is Ihipfarrt^d Into a $uj^rogate 
mother, and animais carrytng the transgene are identified. The resui^ng ahlmai can then be 
friultiplied fey cohwntlona' breeding. Tti© polypeptide may fo® purified from the anim^rs mffk, §e® 
Meade. HM. et ai (1999); Expression of recombinant proteins in the milk of ImnsgeNc 

20 an imals. Gene expression systems: Using nature for the art 6f ^presston, J, M, Fernandez and 
J. P. Hom&r (eds,), Academic Press. 

In the alternative, in sirde-rto produce a transgenic norvhumar^ animal that carries In ti-se 
genome of its somatic and/or germ ceils a nuoeic acid sequence including a heterologous 
transgene construct including a transgene encoding the protease, ti^e transgene may be 

26 operably linked to a first regulatory sequence for salivary gland speoislc expression of the 
protease, as disclosed in WO 2000064247; 

The invention nslatcs to a tf-ansgenic, non-human animal, or products, or elements 
thereDf, comprising a poiynudeollde, or an expression vector of poiynucleotlde construct of the 
invention. 

30 

l^^otasdMction 

The present Invention also reiates to methods fer producing a polypeptide of the preser^t 
iRvsiiion comprising (a) duitivatlng a host cefi or a b-ansgenic plant or animal undsr ddndifena 



31 



Gondmh'& fQT pt&imt\on of ths pdypeptfde^ In: a supemataot; and or^ionafly (b) mcov^nngifie 

polypeptide. 

In the pmductlGrt niel^^ of th^: ps^ent 1r!vef"?tJpn, fiie cells are oulivst©d m a nutrtsnt 
medium suSabI© for prodMctlpn of the pciiylje^die using mstscsds known m fta aft> For esamp!®^ 

S the cey may be ^oltivated Ijy shake Hask cuitfv^tloti^ smail-scate or Mrge-scate feriift^nlatfor? 
(Irsdudins oorstmuous, Nteh, Isiei-tesch, or solici state f^rmenMiorts) in i^Jsoratpit or ihdiisSrial 
farmerstors performed In a sultaMf medium arvd under ajnditsdrjs aliswlnp the polypepfide to be 
expressed and/or !SpM©d!> the cutis/stlon takes pfase m a siiSable nutHent mi^isuBi coR^pmis^g 
carboo and fil&t^^en sources aF?d Iriior^enio salts, using procsdures known In the ert< Sultalsle 

10 fttedls are a>/aslab!e from cpmrnefclsl eappliers or m&f be prsepared; aecofdSng tp pybilshed 
cPmposftions i0.g„ In catatogu^s of tin© Amerfsati Type eulture Coliectlofi). If the poiyp^lde Is 
eeoreted Mo th& niMmt m^um> the psl^aptlde cars be recover^ directly ffprn the medium. If 
the pij^^s^ptide Is rK>t ^creted. it can be recov^ed fi*0E?5 c<©f | lysat^. 

The polypeptides may be detected usirsg methods known m the art that are specific for 

1& the po^peptideSv lliese detection methods may include use of spedfic antibodies^ formation of 
a ps"Dduct, or diseppearanc© of a substrate For exsmpie, a protease assay may ba usad to 
datermbe the activity of the poSypeptida: as descritjed herein. 

The resuitlhQ polypeptide may be reooversd by nu^thods known In the art. For exampie, 
the polypeptide may be recovered fmm the nutrisnt medium by conventional pracedures 

20 including, but not iimifed to, centifugstlon: flitratloa extraottofv, epray>-dfying, evaporation, or 
precipitation, 

The polypeptides of the present invention may bs punfied by a variety of procedures 
known In the ert. including, but not limited to, chromatography {e.g., ton exchange, affsnsty, 
hydrophobic, chromatofocusing, and size excSuslon), els^cifophoretic procedures (e>g., 
25 preparative ^soeSectnc focusing)< differential solubility (e,g., emnionium sulfate precipitation) , 
SDS-PAGE, or extraction (sse, e.g., Pmiam Pmfioatim, J.-C. Jansoo and Laj-s Byden, editors. 
VCH Publishers. Hew York, 

3d In a still furStar aspect, the present ihvenSoh relates to eomposiorss compdslng a 

fxsiypeptide of the present Inventlc^. The polypeptide composldns may b& prepared in 
accordance with methods teown In lis art and may be m the form of a ll<|yld or a 
contposftbn. For iRstancai the pofypeptida copipositlon may be In tha form of a grartulate Ois- a 
mlcrogrenylM®. 1ila |5o|ypeptlde to be Ihdudad In the cbmpbsitidn may be ^ebised in 



accordanes with mMheds known In the^rt. Examples Ms given b^o# of prefer us«® of the 
pQlypgptidss or polypeptide compositions of tt^e fnyentiors, 

s Tfm pr^^rit inv^htlors Js ai^ d(res^#d to method^ fer irslng tlis polypfsp^Ssles of the 

Inveritlon in animal fescl, as as to feed compositions and fe^ addlfivss oomprlsfng the 
polypeplldss of ti© invention. term Animal inclUEfess all animals, ^ndycilng huHiari beir^, 
BxBmpI^ of anfrriials are non-FumlnantSj and n^minant^, suqh as eows^ sheep an<:l horsigs. In a 
particular emixxJImenti tli^ animal Is a non-fumlhant snlmal. N^srHpiftilnarit anirRafe Insjlycl® 

10 mono-gastric animsis, s.g. pigs oc swine ^nciucitngi, but not limited to, js'^ets, growing pigs, and 
sow#}; pouitry such as iurKeys, f^uQk^ and <;hiok«n Ciiiciuding but not limited to i>roi|er ct^SsKs, 
layers); yoursg calves; and flsii (including but ndt iifn^i^ to saimon, tout, tilapia, cattish and 
esrps; and cfustseeans (ineius^ng^ut not lift^fted to siin^^ps ard prawns) 

Th^ term fssd or feed odm|sosttlon means any compound* preparation, miJcture, or 

16 composition suitaMe fpr, or Intepded for Intake by an Bnimal 

In the use aecordlng to the invention tiie protease can fed to animal b^fdre, after, 
or simuitaneousiy with the diet The latter Is prefsrred. 

In a psrticuiar embodiment, ti'^e pnsteas®, In the form In y^hich It Is added to tiie feed, or 
when bslng Included In a fsed addstlvs, is well-defined, Well-dsffned means that the proteaso 

2d pr^3parat!on is at least 50% pure as determined by Sise-exduslon chromatography (see ExarrYple 
12 of WO 01/58275), In other particular enibodimgnts ttis protease preparation is at least 60, 70, 
80, 86, 88, 90, 92, 94, or at ieast 95% pune as determined by this method. A well-defined 
protease preparation is advantageous. For l.f^stancs, it Is much ssasier to dose correofly to Ihe 
feed a prolease that is essentially free from interfering or coniavninaling otl^er proteases. The 

25 term dose correctly refers in pajticufar to the objective sf obtaining consislent and constant 
results, and the capabisity of optimising dosage based upon Iho desired effect 

For the use in animai feed, However, the protease need not :be that pure; It rt^ay e.g. 
Include othsr ©nzymes, in which case It could be termed s protease preparation. The protease 
preparation dan be fa) added difsstly to the feed (or usad dire^^ly In a treatment process of 

30 Vdgetabie prptelns)^ or (b) It can foe used in the ixpdMctidh of csns or more inteniiedlate 
compositions sych as feed addiferss or pre^nixes that Is sdbsequently added to the feed (or «s®d 
In a treatment process). The degree of purity dascrlbed above refers to the purity of the or iginai 
protease prepamtion, whether Used according to (a) or (b) above. 
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Profess prepafatlofis mih ipunttes sf this order of mapitude are Sn fartitEujaf obtasnabfe 
using recom&iRant methocte of production, wbemasihey ars fsot so ©asiijf obMnsd and also 
subject to a mu# Ngher b^t0Mo-bal5h vanatfof! when the p?ot®|se Is pitsduoed b|f tradlionaf 
femtenMan methods.. $uph pmiB&m preparation may of oourse be mjx©!3 with otf^er ©naym^, 
S In a p^jftlcsulsir embodiment the protease for use aecdrcSr^ to tHe Imyentiorj i$ c^palslf of 

soiubilisinp v<egMab!® prcjteins, A suttiabfe a^^ for d^ermiNng solobised protein le discioeed 
InExi^mp'te 11, 

The term vsgsisbte protains as; ys^id l>@rafn falters t& any csmppund, c©mpos|Sosi, 
praparatlpfs or rj^pum iiai Mudas at teast orse protein danyed: &'om or ofiginatlog fmm a 

10 vegetaSitei ; iBoSudins modifiad protains and protelrs-deHvatiWS. in paticular ambod!mef?ts, iHa 
protain oofit^rit d| th«i yeg^tabla protieif5S is at Je^st 10^ 20, 30, 4£S, SO, or 60% (w^^). Veg^tabia 
protains may bs di&h'\fed from vagetabia proteirs sourcaa, siidh as lagumas mraabi for 
axampis materials from plants of faitiiile$( f&bBma& (L&gsjmmsm}, Cru6ihr^cem, 
Ohmap^im&ae, and Poamae, sudi as soy bean maai, lupsn maai and ihapasead mas!. M a 

IS paiiicular embodlmifent, ih$ vagetaisl^ protete source is materia! from one or more plants of ifm 
farrtiiy Fateteae, e,g. soyfoean; lupine, p^a, or bsan. 

in another particular ertibodimeril, tha vegetabia protein source Is material from me or 
more plants of ti^e family C/fafiopod/scaas, ag. Iseet, sugar beet, spsnaoh or qulnoa. Oibm 
examples of vegetable protein sources are rapesssd, and cabbage, Soyijesn Is a pre^^radj 

20 vegetable protein source, Otiier exampiss of vegetabis protein sources are cereals such 
bailey, wheat, rye. oat^ maise (corn), rice, and sorghum. 

The tr6,atment according to the invention of vegetable proteins with at ieast one protease 
of the invention results in an increased solubillsation of vegetable proteins. The following are 
examples of % soiubilised erot-sin obtainable using tiie proteases of the invention iri a 

2S monogastric in vUm modei: At ieast 102%, 103%, 104%, 10S%, 106%, or at least 107%, relative 
to a blani?. Tiie percentage of sclubilised protein is delermlneci using the monogastric in viim 
model of Example 1 1 , The term soiubilisation of proteins basically means bringing protoin(s) 
into solution. Such solublissation may be dua to protasse-medMed rsieass of prptein fmrn other 
cdmponants of tha usuaiiy compiax natural oompositions suoh as feed 

30 In a pfthei' par^cuNr embodlmarit, f^e protasss for use accorjciing to the Inyehlion i$ 

capable of IncreesIng the emourtt of rfiges proteins. Tha foUswlng are examples of 

% digested or digestible pfotein obtainable if^lng the protease^ of tiis rover^tiors in a monogastric 
in mn mom: At ieast 104%. 105%, 10S%, 10^^%, 106%, 109%, or at least 110%, rslatve to a 
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Mmk< f he perc©rta^e of digests or digei^llaie pmtein b dMsnT^in^d using the in yte model of 
Example 11 

The foSowirig are ©xampfes cf % dig^tes^ or dips^iMe protein obtalnalste usMtf tits 
prpteasqs of tie frwentlon in $n aq«acuitMfe If? vllrom@4^f:M i$mi lOSSi, 1Q4%, 10S%, 196%, 
§ 107%, 10@%j 1pS% or at least 110%, miative to a blank; The pafeeiltage of digested of 
digestible prpteir? is ifet^^iif^dusing^^te^ Kfifmrapdei pf iExsmple 12> 

In a still fMrthef psftloular embodiments ttje protsf se loruse aeeof(|!n§ totb# mventi©}'! is 
sapabie cf Increaslrig the Pe^ree^ of Hydfolysls (t^) of yegetsbte proteins, Txhs fo!lo»ig as^ 
©xamplei of Degree df Hv^rolysIs Ino^ In a nionogsstric M \^ittt? mo«$sSj At laast 

10 1 &2%, 103% 104%, i 05%, 1 08%, pr # lesst W%> mlB^ve to a blank. Tbe PH : is «l^ermlned 
uslnig tfe rnonpgaistnc in i#?tif rnddei of Example 11. The foi|owln§ ^?^ exarnples of Pegrae of 
Hydroiy^ls Inersase obtainable in en eiEiuaculture /n vstm model: At least 102%^ 103%, 104%, 
106^,: 106%i or least 107%, r^M^B io i blank. The DH Is dMemilned usslr^ the as^uaoiijltum 
in vitro mod-eS of Example 12, 

IS In a psfticular Bitibodlrnent of a {prO^-) trfatmer^t process of ih^ Invfefltson, the proteas©(s) 

Ih question is affecting (or aeting on, or exerting Its solublljsing influejiGe on) the vegetafcfe 
protaijiS! or protain sourcas. To achieve thfe, the vegetable pmteln or protsln sourfee Is iyplcalljf 
suspsr^ded in a solvents e,g, an aoueous sosvent such as water, and the pH and temperature 
values am adjusted paying du® rsgas^d to the characteristics of the enzyme in questbn, For 

28 exampie, the treatrrvs-nt may take piace at a pH-value at which tne adsvity of the actual protease 
is st least at ieast 40%, 50%, 60%, 70%. 80% or at ieaat 90%, Likewise, for exar^pie, the 
treatnient may take place at a ter^persture at which ti>e acOvity of the actual prstesse ;s at teast 
40%, 60%, 80%, 70%, 80% or at ieast S0%, The sbovs percemsge activity indications are 
relative to the maximum activities. T'he ensymatic reaction is continued untii the desired resuit !s 

M achieved, following which it .may or msty not ba -stoppad by inactivating the enayrrse, e.g, by a 
heat"t.reatment step. 

in another particular emfeodlmerjt of a Ireatment procass of the Invantipn, the pfpteaae 
action is sustained, meaning e.g. that the protease is added to the vegetebte proteins or protein 
aQU,n;es, bdt Its solubliislng mHisence Is so to speak not switohed on until lat^ whe« desired, 
30 onc^ suitable sslMbilising condioni& are estafollish^:. or on<^ any enzyme Inhltbitors are 
Ineetlveled, or ^sfhataver other means pouid have iseen applied to postpone the aetion of the 
enzyme. 

In one OTihodlment the treatment le a pre^treatment of a«imai feed or vegetable prot#ms 
for yse l« ghimalfeed. 



TH^ tmn mprmmg the n^^rlttons! value ef art anima! ted fT5^M impmvisig the 
BvailaBiffty and/or dsgestsbility of the proteins, therefoy leading to in&Bm^ pm^in mtmcMm 
imn% the eompon^irls, jhiigii^r protssn ytelds, Increasesl protein #gf®<:^MIon an^ 
pr&Mn titisatldn. The nutritional vsiue of tie feed |s therefore iTOease^ and the anm-ml 
5 perfef?nance sych growth rate ^rsd/pr vsejght gain Sfhd/6r feed cbnv {i.e. the weight 

of Sngsstediissad relativfif to vs?s%tit gain) of th© srilmal is/are smprp^ad. 

In a p3i1fcul8f ombQdfmsntthe fead oonvwslo« ratio |s iftersa^ad Isy at i^sasl 1%, 2%, 
$%i 4%, S%, 6%, 7%, 8%, o? at least 10%, \n a furttier p^ftisular smisodlmeni iba w©f|ht 
g^ini Is IrKJreased by et Seaft 2%, 3%, 4%, 6%, S%, 7?^>, S%, 9%, tO% 0r atle^st 11%, Th«se 
10 flguf^ ar# relative to coritrbi esiperlmenfe v»fth no protease additjon, 

The feed conver^on ratio <FGR) anc^ the weight gain may he oefcuieted as described In 
EEC (1886); Directive de ia Oommlssfon du 9 avrit 1986 fixar^ ia m^hode de isdmi de la valeyr 
inilrgetlcius des aljrn^iite cort^pos^^ destines ^ la vplafile. JoMfT^ai OlSclel des Gornrfiunautss 
Eufopsennesi LI 30, S3 - S4, 
is the proteass can be added to the feed In any form, be it es a relatively pure proteose, or 

In admixture with other components Intended for addition to animal feed, I.e. In the form df 
animal feed additivee, suoh ae tlie so-called pre-mlxeslor animal feed, 

in a farther aspect the present Invention relates to composl^ons fer use In anteai feed, 
sqch as animal feed, and animal feed additives, erg, premsxes, 
20 Apart from the protease of the snvention, the animal feed additives sf the invention 

cornain at least one fat-soiuble vitamin, and/or at least one water soluble vitamin, and/or.at least 
one trace mineral. The feed additive rnay also contain at least one macro mineral. 

Further, optlonai, feed-additive ingredients as's colouring agents, aroma confounds, 
stabilisers, anlimicrobial peptides, Including antifungal polypeptides, and/er at least one otf^er 
25 eniyme selected from amongst phytase (EC 3,1,3,8 or 3,1,3,25); xylanase (EC 3,2.1.8); 
galactanase (EC 3,2,1.89); alpha-gaisctosldase (EC 3,2.1,22); protease (EC 3.4,-,-), 
phospholipase A1 (EC 3,1,132); phospholipasa A2 (EC 3.1,1,4); lysophosphoilpase (EC 
3.1,1.5); phosphoiipase C (3.14.3); phosphoitpase P (EC 3J.4.4); end/or beta-glycanase (EC 
3,2,14 or gC mi,6). 

p In a parllcular emhodiment these other enemas ere weil-de^ned (m deigned ehove fot 

profease preparations). 

Examples of antimfcrdisiai peptides (AtvlP's) are CAP1B, Leucdcin A, Tritrptlcin, 
PfOteigrln-1, Thanaiin, Deffenain, LaPtoferrsn, Lactoferrldn, and Ovisplrln mch as Ndvisprin 
(Rohest tehrar, Pleotaslr^, StatiJia, Incsluditig the compounds and polypeptides 



dlisdossd m and PGt/DKO2/0G812, as well M ■»/anante w fmgmenfe of 

above tNat retain aMrnlorobiai acivsty. 

Examples of arstifungaf polypeptides (i'^P's) th& AspBrgilkis gigaateas, md 
^spergiffm »/g^r |)ep1|des, as well as vartanlte and ftagmsnt?^ thereof whsdi mMm antsfcogaS 
§ acSklty, as dlsclosisci In 84/014S8 and WO 021590384. 

ysafly:%t- an<i vi^ter-^soyiste #arnsF?s, as; wallas traoe minerals iom |?aft of a sfCH^IIad 
prmmi. interidarf for additlpn to the feed, xj^ereas macro rrilnemfe are usually f ©isafati^ adc^ecS 
fe the feed, A (jrerntx annshad with a |Hx>|ea$a of the ir^ventlon. Is an example of m arslma! f^sd 
adcJitlve of the invention. 

10 \n a parSoiiar embodiment the animal feed addlfive sf the Invention Is iritwidsd for being 

included {or ps^s^ibed # having to be IhGftided) Ifil animal iiiets or fesd st Isvals of 0.01 to 
IQ.0%; more isarilcularly 0.05 to 6 0%; or 0.2 Id 1,11% (%; meaning g addlSve per 1^ g feed). 
This Is ^d In parfidular for premtxes, 

THe following are r^ssl-i^excluslve lists of axarnptes oftheise cdmporisnts; 

IS Examples of lat-solMbte V^^ are vitaitMh A, >^ltarpb 03, vitamin and vlamSi e.g. 

vitamin K3, 

Exampiss of watar-solubte vtamir^s are vitamin B12. biotSn md ehollne, viamln 81, 
vitamin B2, vitamin 86, niacin, folic add and pantiothenats, e.g. Ga-D~panthoib«nate. 

Examples of trade minerals ar® manganese^ alriCj irdn, oopper, Iodine, eeleriiym^ arid 

28 CDbalt 

Examples of macro minerais are csicium, phosphorus and sodium. 

The nulritsona! requiremerits of these oom^xjnents (sxemplifisd with pQuitr/ and 
pigiets/pigs) are issted in Tabis.'^ of WO 01.'$827S. Nutritional requir«mem means tiiat ti^ese 
comporsents should bo provided in the diet in tiie conce,ntr3tions indicated, 
2S in the alter.native, the animai feed additive of the invention comprises at least one of the 

indlvifiual components specified in Tsijle A of WO 01/S8275. At isast one means either of. one 
or more of, one, or two, or three, or four and so foitii up to ali tiiirteen, or up to all fifteen 
individual components. More specifically, this at least one individua! cornpon^3Rt Is Included in 
the addltiv© of tiie Ifwention in such an amount as to provide an h^feedrconoentralfon within the 
38 range i ndldafei In colyrnn four, or cpltimn ftve, or column six of Tsble A. 

The presarst invenSon also refatas to animal feed eorrspositlons. Animal feed 
compositions or diete have a relalivefy high oontent of protein^ Poultry' and pig diets can iae 
charadtedsed as mdicated in Table B of WG 01/5^27i&, columns 2«3. Fish diets can be 
char^actertsed ae Indicated ih coldmn 4 0 such fish dietis usually have 
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a crude fat contsm of 200-<310 g/kg, WO Q1/S827S correspond to US mflJBZM wU<^ is 

hereby Incorporated by refwenc©- 

Ati sfitiita! f ee<f composiilon ac^ordiiig to th® | wsr^lidn has a cryiiJe protem content of S&- 

SOG 0^kgv an4 furth0rffi<>m <;Qmi5!ls©$ at laasS one protease as cteimessl her^iin. 
§ rurtherrnom, pr In the aftemaSva (to the c;r«# pmt^in content mdicssfed afeoye), tsf 

anfmal feecf composition of the Invention has s content of meta&oibaNe energy df lCS^ 

andfqr a cor5tent>f calcium of 0,1 -SOO giku> and/or a contiSM of ^V^Hebie piid^hortiis of 0,1-390 

g^g; mMlm a cqntet of ni^hloning of 0.1^1(50 and/or a content of metNanlne plus 

e?^slein© of O,1-*1S0 g^g; and/or a coMerU of I'^tm M g/kg, 
10 in particular embodimarjts, the content of rtiietaboflsaMe energy, crude pi^tein^ ealclum, 

phissfi^orus, mefjionine, metlilpnlna plus cystalna, atidfer iysine is wlSisfS sny one of r^nge^ S, 3, 

4 or 5 In tabls B of VVO £51/88275 (R: 

Cnide pfoteifi Is caicui^ted ass nitfogen {N) irjuitipii^d by a faetof 6i2S, la. Crude prolelr^ 

W^9}~ ^ (9^0) X ^-^^i The nitrogen content is detenrsinfed &y the Kjeldahr rrtsthoci |A.OAC.> 
is 1984, Offsciai Method of Analysis 14th M., AsspdatSon of Officiai Anaiytloai Chemists, 

Washington DC). 

f^etabplisabie energy can foe caicuisted on the basis of the HHQ publication ;N5.ilrfent 
rsqulremsnts In swfnes ninth revised edition 1988, suiaconTOittee on swine nytrttion, commitse 
on aniniai nutrition, Srfoard of agriculture, nationai research council. National Academy Pmss, 
20 VVashingtsn, D C,, pp. 2-8, and the European Table of Enengy Values for Pouitry Feed-stuffs, 
Speiderhoit centre fcr poultn.'" research and extension, 7361 DA Beekbergen, The |s|ett)efland$,. 
Grafisch bediijf Ponsen B icaijen bv, Wacjeningen. ISBN 90-71463-12-5. 

The dietsn/ ccntsnt csf caesium, availat>le phosphonjs and amino acids In complete 
animal dfets Is caicusalsd on the basis of feed tabSgs such as Veevoederlaisei 199?, gegevens 
25 over chemische samenstsliing, verteofhaarheid en voaderwaarda van voedemilddelen, Cenlral 
A/eevosderburesii, Runcierweg 6, 8219 pk lolystad. ISBN §0-72839-1 3-7. 

In a particular embodiment, the animal feed composition of the invention contains at least 
one vegetable protein or protain souroe as deftr^ed atKJve. 

\n still fufth^ psirtiouiar emfoodimants, tue animal feed composition of the Invention 
SO cofjiteilris 9-80% rn^ixe^ and/or &-S®% sorgtium; and/or 0-70% wheat; and/or 0r70% Bartey; 
and/oi' 0^30% oats; and/of 0-40% soyfoaan maai; and/orO-10%jshniiealj and/of 0-20% whey. 

Animal diets can e,g, foe manufaetured as masli feed (non psiioted) or pelleted feed. 
Typically, the mlllM feed^stuis are mix&i and ^^slent amounts of asssntlsl vitamlhs and 
rninerals are addaci ^ccdrtllng to the speoiffcations for the species In qu^tlpn. Enzymes can be 
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added as solid or Hqjuid enzpis fdrmaSaSsffs, For exsfftptei a soK enzyms fofrnuMte Is 
typically added before or duHrig tie mmn§ step; sod a Kqulcf enzyme preparation Is typically 
addetf aft#^ the peSeSng stsp> The enxyms may a!ss ise meorporated m & feed additive or 
ptemlk. 

S Th^ final ^nxyme conoentr^iQn m th^ diet pthin the rgnge of 0,01 ~2Q0 mg ^^xyrne 

pmirni prn kg dtet^ for ex^pi^ in the range of 0,S^2S gn;^me pfotan per^ kg animai dlgst 

The proteasss MkoiiM of cotirse be applied Iri an effective amoMrsti j.e< in asi sinlount 
^4&qmt@ for Improvlrsg soiybiSsation an#or improving nutritional value df f^d, It is at present 
oc>MemplatM tlist the ssisym^ is g#iinMer^ m om or m^f© of tte following arhoiihts fdosaps 
ia mng«s): 0:01-2PD; a01-1Q0; %S-1Cm; 6-100; 10r1»; 0,S5~S0; or 0.10-10 - all these 
ranges ij^ng lii rrsg protease er^yme protest per kg fesd (p^rfi). 

For dstermfning mg era^fma protein per kg feed, tha protease Is pyrlfisd f fern the feed 
#mps>sitbiii, ari<t th^ specific ^eliv^y of ihk puriied pr^otease is determfh^ mng a relevssht 
assay (see tinder fs'oteasie activity, substrj^tes, and assays). The protease activity of the feed 
IS Gomposltion as ssfsi^ Is a!sd determined ussng the same assay, and on the -basis isf tHsse tvsfo 
delermlnatlonSj the dosage in mg eh^'me pmteih per !<g feed is caibulated. 

The sem© pHnoipies apply for determmlng nig enzyrrse protein in feed additives. Of 
course, It a sampis is available of the p-ote^se $ise^ for preparing the feed additive or the feed, 
the specific activity' is dstsrmined frorr? Ihis saf^pie (no need to punfy the protease: from liie feed 
20 composition or the additive) . 

The present invention is further described by ths foilowing examples whicli stold not be 
construed as Smiting the scope of tt^e Invention. 

Detia;^ent Compositions 

25 The protoass of the invention may be added to and thus become a component of a 

detergent composition. The detergent composition of the invention may for example be 
farmulated as a |-)and or mad-sine laundry detergent composition including a iaundjy additivo 
oomposition syitabie for pre~treatment of stained fabrics ar^d a rinse added fabric sofisner 
sonipositlon, or be ferwulated s determent compesiSon for use In general hoesehofd hard 

SO surfade deaning operations, or be forrrtiflated for hand or machine dishwasNng pperaflons< 

In a specific aspect; tbe Inventlcn provides a determent additjSve eompfislng fm protease 
of th^ Invention, The dstergent addltlvfe iss vveil as the detiergent composition may comprise one 
c«' rnore other ensymes sueh as another protease, siich as aiicaltne proteases from SacOlus, a 
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lipase^ a cullnass, an amvlasa. a carfeohycteses a ceJlulsse, & ^mMmm, a maftnahasej ari 
afablnase, a galactanase, a xylarsass, an ©xidsse, e.g., a laosase, ancH^isr a peroxieiase. 

Iti g&mf&i tM pmportlm # th© chosen enzyme(s| ghould ts© cofr)p|lble ®lth fh© 
sesisdad determent ^Le, pH-optimym, compatifei!% wsili otbsr enxptiatic m4 non-enzymaflc 
$ ingmdisrt«, etc,), as^d estsyiiieC^I shoMfd prssf lit in am>unt$. 

Syltabis lipases lodude ti&m &fhmMm\m tegai sdgifs, Chemical!^ meidiisd orpfotsifs 
mgltmer^ mi$mU ar^ jnsilMd^d Bjsatspte^ of u$^ul lipases induda lipases; from Hmvo&iB 
(syiioftym Tj^emjomj'sses)* a.g^ from ilmu0m$s if. hmsgmmm) as descrlised in 2SS0$8 
arid £P 305^10 Of fmm H. /n^/e^j?^? as desprifoed In WO §6/135S0, a Pmu^mcB&s ispasf , a,g, 

IS from P, ^c&fsg&nm m P. psei!<fmhi^^g0nBS |EP 218272|v R mpaci& iBP 331370), P, $Mmfi 
C6B 1,372,034), P. Juoremens, Pseadmmss sp, stmin SP 70S WQ 9S/06723 and V^D 
06/2?002)^ P; m§mhslri4mls i^Q 96/12012), a Sacftws ispase, fe g^ from B. s«M//s (Daftois el 
al {ISM), Slopilgmica et Biophysiea Ada, 1131, 2S3-36S), B. ^^WQth&mt^kilm (4P 
64/744992) or B, piimliis (WO 91/16422). Other exaiVsples ^re lipase vatiants sucsll m those 

IS descfibed in WO 02/OS240, VVO 94/01541. EP 40T22S, Ef? 56010$. WO 95/35S81. WO 
96/00292. WO 96/30744, WO 94/2SS78, WO 95/14783, WO 0S/2261 S, WO i?/0407S and WO 
97/07202, Prsferred commerdaify avallai^e llpasa enzymes Ihciud® Updase^** and Uppiase 

Suitslste amylases (alpiia- and/s^ isela-) imitida tsose pf badenal or fungal origin, 
20 ChemiGsHy modified or protein engirifeer-ed mutants are irduded. Amylases include, for exsmpis, 
alpha-amyiasea obtained from Baciibs.. e.g, a spedal strain of S. ^ichmtifomis, descnbed In 
hiore detail in GB 1,296,839. Exam.pies of useful amylases are \he variants described WO 
94/02587, WO 94/18314, WO 96/23873, and WO §7/43424, especially tile variants wife 
substitutions in one or more of ^he following positions; 15, 23, 105< 108, 124, 12S, 133, 15*4, 1S8, 
as 181, 188, 190, 197, 202, 208, 209, 243, 284, 304, 305, 391, 408, and 444. Commercially 
avasiabls arnyiasss are DuraniyP\ Tarmamyi'-'', Fungamyl'^ and 8AN'^' (Isiovozymes A.''S), 
Rapiciase^''''' and Purastar^''- (from Oenericor International inc.). 

Suitable celiuiases include thos>3 of bacterial or fungal origin. Chemically modlffed or 
prntein snginesfed mutants are included, Syitable eeiiulases include oeilulases from the genera 
SO Bmllfus, P^mdm^ Humf(^a, FmaHiim, MsB^s^k Acfimmtutn, e g, the tipg^l celluloses 
prtKlucad from riuinieol& Jnsoiens, Myceliopliihom them^phM and Fismmm oxyspomm 
disctoeed in US 4,435,307, US 5,Q48,aS3, US 0.0011 7B, US S,776,?57 and WO 89/09259. 
E^pedali^ sustade cellulas^ are the alk^fne or neutral cellulases hmn^ cdidar cars benefits* 
Exampfes of sueli ceiuiases are cdiyiases descril)edi in EP 0 40&2S7, EP §3i|72, WO 



96/11262, WO 96/2930?, W0 M/0884D. Other examp!^^ are celluSsse variants sych as thoss 
<lescr!bed m WQ EP 0 S31 SIS, US 5,4S7,046. US S,688,593, OS 8^763,254. WO 

05/24471. WO 98/12307 and WO 90/01544. Gommsmyiy avsslafels cslful^es ioelude 
Cdluzyme^*', and Car^zyrtie^''* povazjR'nes C\smnm^^\ and Puradax HA^^"^ {<lan©i?ia?i' 
S liitemalMnal \m.% and KAG-SOOCBf^ {Kao Cosp^mtton). 

SuSabte peroxldla$eSifp3!iiias©s Jncliiie those of {Slant basienal or fungsil origin. 
Qhemicaily m<>c!ft0d or pfotefr) ehglrieei^ mutaf^ &m inclMd^v fxajtjpi^s of usefyS 
I^re)sl#s8s Induce p«roxldgse$:from^^C^ from G.; cimmmf and variants thereof as 

those doscsfjIsecJ In WO 93^^13, Wp 95/1Q8Q2. and WO S8/1S2S7. Commefqi^My avaiSsb!© 

10 |sgro)d#isesJndude <^im4^m&^" f Noyoaymes). 

Th^ d0te?geftt «RiKyrns{s) rnaif Ba Inelyded m a d^t^t^est eompiositiorj by adding 
separate additives eorttainihg one or more enzymes, or by adding a coitiMhed additive 
cornpifislh^ ail df th^e enzymes, A sifetengfent additiw ih^ Irivintidhj is; a separat© ^ddftive of 
a combined addiSve, can be formuMted e>g, as iss granyfats, a iSquSd, a slurry, etc. l*rsfefTed 

IS dstergant addttiva ibrmdlatea ar© granulates, in particuiar nors-diistSrig gf^nulates, iiqUlds, in 
partlGular staSsiiked ll^^uids, or slurries. 

MDsvdussting granulates may be prcxiuaed. e.g., as disclosed in US 4,106,991 and 
4,661 ,4§2 and may sjptionaily be ooated by methods S<nown in the art, Exsmpies of waxy coating 
n-Yaterials are pdlyfethyiene oxide) produces (poiyethyleneQlycol PEG) with mmi\ n'wlsr weights 

2(5 of 1Q00 to 20000; etho.xylated nonylphenols having from 16 to 50 etiiylene oxide units; 
ethoxyiated fatty aicosiois in which ti's aicoS-ioi contains from 12 to 20 csrbon atoms snd in which 
timers -are 15 to SO ethylene oxide units; fatty aicohols; fatty acids; and mono- and ds- and 
triglycerides of Mty acids, Examptes of ftim-fomiing coating materials suitable for application by 
fluid bed techniques are given in GB 1483591 Liquid enzyme p,reparations ntay\ for instsnce, be 

IS stabllfeed by adding a poiyoi such ae propylene glycol, a sugar csr sugar alcohol, lactic sc;ld or 
bone acid according to established .methods, Protected enzyrjies may be prepared according to 
thss method disclosed in EP 238216. 

The detergent composition of tbeif^vention may tsse In any convenient form, e.g., a bar, a 
tablet, a powder, a granule, a paste or a liquid, A liqyfd detergent may b« aquaouis, typically 

30 obhtatnlrjg up to 7Q % wM^r and % nrgsf solyartt, or noh^gbeot^. 

The detefgent composition oomprisas Ofi© or more sijrfac^an^v whioti may be non-bni^^ 
induding semi-polar and/oi' anionlb and/or catlonic and/or ajgsfitteflonis. The surfastarsts are 
typically present at a levai of f oiii 0.1% to $0% by weight 
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Vyhsn IrnGiydsd thefain tsa dsterggsrst will usyafly eonlain from about 1 % to a&out 40% of 
an arslsnic surfec|a«t such Bnear slk^benzesiesMlfonate, aipha-olefsnsuifonate, aikyi sislfete 
|fa% slophoi sulfgst®)j aleohol ©thoxysulMe, se?:bt"sda?y alNanMuStoatSs s|pha-suifo fa% acid 
methyS ester* alkyt-'Orafenylsuqcihteac^iii or $oa^, 
5 Vyhen inc!u?l0d ih0^in tte ct^tef^ent Msu^Ny oont^sn frorn ^fooyt0.2% to sbout 40% 

of a nofv^Mc surf^ alcohpi etip)Qfate, rsofsji^phanol ethoxylate, aSkylpolygiycosiijle, 

r«onosthanolami<^% pofyhydfoxy alkyi fait^ add ar«idfe, or N?-acyl N-aikyI darwatjves of 

1 0 The detet|@nt may oontssn Q«0S % ei a defergent builder or csmpfeJiing ag^nl such as 

zeol!t6, diphosphate, idphosphate, phesphonat^j oartjsaiisft©, cilf^tss, rtitriiotriaostic MM, 
etiiytefs^diamlrsst^tfaacietie; mU, diBti^ehMrfamitiepsntaacstiG add, ali^yl- or alkenylsucsinis 
aqsd, solufele si!lsat# or layered $j}tt#e$ (e.g. SKS-0frpm Ho^chst), 

Th^ detergent may somjsrfSis one or rt\om poly mens. Examples are 

IS carfooxymetlsyteeiMqiSe, poiyCvinylpyfroiidonfe), poly (ethylene glycoi), p£j!y(viiiyl atcohd), 
|)o!yCvlnylpyridlnss-lsl»o>rlde), poly(yinyl!mEdaz:ol8), iioiyoarboxyiatss such as pdiyacaryiatM, 
mafeio'acr^'lfe acid copolymers and fauryl methao^ate/aof^lp aoid ccipD!y.?T!®rs, 

The detergent may contain a bteaching systefr; which may comprise a HjOs source such 
as perborate or psfcarfoonate x*?hich may he combined vvlth a psracid~fbrmin>g Isleach sstsvator 

20 such as tetraacetySethytenedsamine or sionano-yloxybenzenesulfonate, Aiternalively, the 
bleaGhing system may cornp.nss percxyacids of e.g. the amide, imicfe, or sulfon^e type. 

The ensym0(s) of tiie detergent comp-osiiion of the invention may be stabsiized using 
conventianai slsbifeing agents, e.g., a posyos such as propylene glycol or giycerol, a sugar or 
sugar aicohol, ladic add, boric acid, or a boric add denvetive, e.g., an aromatic borate ester, or 

25 a phsnyi boronic acid denvativ© such ae 4-formy!phanyi boronic acid, and the composition may 
b« Ibrmuiated described in e,g, WO 92/19709 and WO 92/19708, 

The detergent msy also contain c^hsr conventional detergent ingredients such as e.g. 
fabric conditioners including clays, foam boosters, suds suppressors, ar>ti~corrosion agents, seii- 
suepending agents, antl-soi! redepo§iiion agents, dyes, isac^wlofdss, optica! brightaners, 

M hydrotropes, t|mis!) thhibitorS; or parfemes. 

It is at presei^isorjtemplated fetin tiie deterged enzyme, in parffCiilar 

the enzyme of the sfiyentiof?, xmf bp added in an anmint oofresponding to 0,01- 100 mg of 
e«:Kyma Jsrot^b per Kiar ol t^siN Isi^r, prafarabfy O.dSrS m§ of en?yme protein per liter of wash 
liqoor, In partiouiar 0,1-1 mg of anayifss protein per liter of wash llcfour. 
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The enzyme of the ifjyerjtion may additionally tee Incsifporsted in tsa detergent 

The ffsverition descntssd and claimsd fiemsn is ndl to lis lmM6 in scope by tte ^pscle 
smbodimerts hereiJi disclossd, t}l®s@ embsdim^R^^ ar^ irttsnded ifeslm^ons of severs^l 
5 ^pesla of the iiiv^h^on. Any eqyfvslerift embckjiments intended to fef within IHte ^opa ©f 
this Irsv^iiUon, tedeed, various modicstfons of -the bvenSors in addltisn Is those shown and 
desorifeed herein wiil feffsorm ^app^r^nt to thcssie sklii^ m art fi^om the foregoiiig das(s1pfi«. 
Such modifscaticsfis siso intended to fai( within the sdop« of #!e appended claims In th^ 
past of confiici the: present di!^io0^^^ definitldhs will con^mL 

10 Various references are cMsd hereiftj the disclosures <rf which m& Inmrp^tBlt^ bf 

referent in thar entir#ies. 

Materials anci mi^8i«ds 
1S Strains: 

BbgHIus mbtilm PLi Wi (Did^Hcten, B st ai, iSSt), donlng of sidB, which encodes 
aspna~«s©tDfa>rfti3ie oecarooxyiase, an eKoer^me trom a^omuh Qfev>% v, ssactenoii 

28 Badilus sabBs PL359S-3? 

BB€m& sutms MB 1510 

Bsafe sydMjs PL2308- This strain is the B,sisbms DIS11885 with disrupted apr and 
np,- genes (Oidenchsen. B, , Wedsted, U„ Hedegaard, L, Jensen, B. R., 8|0hoim. C. 
(1900) Cloning of sidB, which encodes aipha-acetoiactate decarboxylase, an 
m 6xoenzyme from BadHus brevis. J. Sactersoi,. 1 72, 431 5-^321) which is also disrupted 

sn ths transcriptional unit of the known Saci7?(is subtliiB ceiluiase gene, resulting in 
ceiiulase negative celis. The cilsruption was pe?formed essentially as described In 
(Eds. A,L, Soo©Rsh#n, JA Hoch and Richard Losick (1983) Badilus subtilis and other 
arara»-PositiVe Badeda, Americsn Socl^ fc^ micfelslclogy, p.SIS). 

30 

i^8f¥§st 1 .5 mf cuiture and reesispend i?^ 100 pi TEL. leave at 3TC for 30 msn. 
Add SOB pi thioopate feuHer and ieave at rmm temperatisre for i S min. 
Add 250 pi i^H4Ac and leave # ice for 10 mlh. 
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M4 Q\h md mix 

Transfer to a nwocsntriftsg© and spin for 1 0 m|n, at full sps^. 

Transfer sup^mstaMfe a new Eppa«(Jorf tub© wid acid 0,S4 viOl^^ cold jsopropajlql Mix 
Resuspend the senamic QHA m 100 TER, 



IE: 10 my mmCk pH 7.4 

1 EDTA, pH 8-9 
TEL: 50 m^M Lys©2^m lii TE-Mer 

io6 my eWta 

QM%^ll^iv^^H^^ s-bdium sail. 

60 g thbcyarmts, M mi 0.6 y ^DtA, 8;0, 20 N20 
dlss^Jlvas at Cool down Rt srid add 0.6 g N- 
iaurytesfftosine.. Add HSCS tb 100 mi and filter It through a 0.2 p 

HH4m IS M CH3COONH4 

TEH; 1 [.sgfmi Hnase A in TE-biif?er 

CIA: Chtorofsrm/tsoaniy{aJ£!ohoi24:1 

10 PGR fragment can be puril^eci u^sng GFX PGR DNA and Gel Band^'*'* Purif fcstion Kit 

(Pharrnacia Biotech) accc>rdi!>g to the manufacturer's Instructions. The riucleolfde sequences of 
the ar^iplified PGR fragments are determined on an A8! PRISM"^-' 3700 DNA Analyzer (PerHIn 
Eimer, USA) using 50-1Q0 ng as template, the Taq deoxy-termsnal cycle sequencing kit (Perklo 
Elmer, USA), fiuorescent labeiets terminators and 5 pj^wi of ths se-^quencing prirner of choica. 

1S 



TY: (As deSQfib^d In Ausyb^i; F> # Cfds.) ''Current pr^toccsb fh Mofepylipif Biology?", jQho 
VVItey and Sons, 

LB agar; (As desc4bed In Ausubel F, M. et feds.) "Ciirrer?! protocols in Molecular Biology ". 
2D Jofsn ^^Vltey and Sorts, 1 99S). 
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LB-PG agar; is LB agar supptementecl with 0,S% Glucose ansi 0MB U pstasslum phosphate, pH 
7.0- 

s prcstes$0 is ffie:3§ym?l usfng tho PNA assay with sMccinyl-alanlriS-alamoe-^ 

ts8 PNA assay Is descnfeisd sn Rothgeb, T.M.^ Gopdiafiflef/B.D., Garifsort, PM, m4B^t% LA, 
Joyrna! of ths Ameilcan Oil Chemists* Society, Voi, 6S (S) p|>, S06-S10 (1988). 

10 Qm& ^pressiQft In BstcMm sUb^s HtM 

All tiNe mptmmi gaies in th* foltowbg exampias are Integrated by homqlqgous 
rseombinMsHi oh the Badflus syMlfe hdst df^il genbms. The gen^s aire esqsressed uniler th® 
Gooft'oi # a triple pmrmier described in VVQ consis^ng of tiiie promoters 

from Ba^l^m iish&nif0m& ajphs-amylase gfehe (BmyL), BaciHus amyiofliijdefad^s elpha-^ 

IS amylase ^ene (amyQ)j and tiia Baciiim ihuiingi&nm sryUiA prtsmoi^ including stolfelng 
sequence. THe gsne coding for CliteFampiieRicoi seetyWransferass was used as maksr. 
(Descn}>e<! if! eg. Pfdenoi^en^B,; P<>ylsien,<3,B,; JoergeiiseR,S,T.; A osefu! cioning vector for 
Baoiilus subtlils. Plasmid 30;312 

2Q Example i/Corsstruotsoa of syrjthetlc iSR tali-variant genes with Savlnase slgr>al 

A sysithetic 10R gene (10RS) encoding g S2A protease denoted 1GR from 
Nocsrdiopsis sp< NRRL 1B^.62 (WO 01/SS276) vvas constructed which iras tire nudeotid© 
sequence $bmn m SEQ !D NO: 1 , This synfnetic gene was fused by PGR in frame to the DNA 
coding for the signal peptide from SAVfNASE'^'* (Novozymas) resulting in the coding sequenos 

2S Sav-1QRS wiiich is sliown in SEQ ID NO: 2. Several tail-variants of this construct xsjere made. 
Compared to the Sav-IOBS protease encoded by SEQ ID N0:2 the tail variant construct Sav- 
10RS HVO was constructed to have 8 amino acids extra in the C4erminus; QSHVQSAP (8EQ 
ID NO; 3) which v»<>ere encod^cl by the foibwlog DMA sequence exter^sion Inserted in front of the 
TAA stopcsdon of SEQ ID NO: 2; 

30 (SEQ ID NO; 4); caatcgcatgtteaatocgctcca 

Tail variant Sav»10RS HV1: was constrycted to have 4 amino adds extra in the C- 
lertTHlnMs; QSAP {SEQ ID NO; 5)* ts'llti itve fpildwihg DNA seiijuehi© Extension insefted in froniof 
the TAA et0p0odoti: 



(SEQ ID NO: 6); castcggctcd; 

TMl variant Bm-I ORS fW^ mm opmtrufefed t© hav0 2 amSno acids extra in th© C-^ 
terrnfeus: OP {UBQ ID NO; 7} with following DNA se<:5uencs 0x|er)ss^^^^ 

(SEQ 10 NO: 8}: caapea 

Tai^vSRantSsVf-IOR^ HV2 vMaiS coristeoted to have orse ^tmm add exiFa Jfs the C- 
t^nilnus: P ($EQ ID NO: 0) vj^th the lolloa^fng DNA sequence pension Insefled In froni ql th© 

(aSQ ID NO: 10): oca 

The iOR® gjefiie and tf?e fa«r tsil-vefiant ersc^ing genes were mli^rsited fey 
hornoibgous mcGmfeiiiatbn into ttjeSa^fes yB19S3 ho^ ceil gi&nome, CfiloramphsriiasI 

is resistant transforman^s we checked lbr protease activify pf> 1% skim mlSk LB-P<3 agar pMm 
(mpptemmi^d vsrfih 6 pg/ml tlitoi-amphenlcol), Sorhe protease positive cGlonies; v^^e further 
^nalyxed by E5N.A ssquencing af the msert to ensure the Gorrect gfene DRA se^^uence, and Hve 
strains, sach coraprising one of the alsove eonstfiicts, were fleeted and dertoisd, respectively: 
S.s#tfe Sav~10RS, 8.subms Sav^lQES HVO, B,mbms Sm-IORB HV1, B,mbm Ba^~iQRB 

20 HV2 and 8.mbms Sav^lORS HV3, 

Exampla 2. Fsmiientation yields of 10R tail-variants with Savlnase signal 

Fermentaticsns for the production of the tsiS-vanant enzymes of the fnventson were 
pfertonned on a rotary shakirig table In SCO ml bellied Erienmeyer tlasks esoh contasrilng tOQ ml 

26: TY supplomenteci wsth 6 mg/1 shlorampher^fcoi. 

Six Efisnmeyor flasks for each of tha five B. subtiHs strains from exaniple- 1 were 
fermented in parallel. Two of the six Erienmeyer flasks were Incubated at S?"C (250 rpm), two at 
30"C (260 rpm), and the last two at 26"C (2o0 rpm), A sample was taken from each shake flask 
at day 1, 2 ar^d S and analysed; lor proteclytio activity. The results are sh&m In tables 1-3. As it 

30 ca^l be seeli irorR tables 1 4» the elieot of tbe larlS: is a surprisingly high Impfovesment on the 
expression level of the protease, as measured by activity IrV tse cuMrs broth, the efface most 
pronouncsd at 2S*C and SO'G, but Is also evident at SF'C as art effect observed especially at 
the early stage ef tbs ferrfsentatson. 
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Table 1; Rslativs protecsSytic activities at 37"C. 



1 Dayl 


Day 2 i Day 3 j 


Sav»10RS 


I M 




1,0 j 


Sa\M ORS HVO 




o.t „ 




Sav-IORS HV1 


1 4J 


1,3 


" iXj 


8ay40HS HV2 


i 2 a 




0:4 1 




J 5,3 




1,7 1 



Tabte 2: Relative proteoiytio activities at 30*C. 



1 Dayl 


Day 2 


OsyS 






1 10 


1,0 


t , V 




8RS HVO 








Sav-i0RS HV1 1 4,6 


ai" 






ORS HV2 


1 2,4 


1,9 


" 2,3 




HV3 




3,0 


_ 4j4: 



Tibbie 3; Rsi^tlv^ pmteolylic a^sHvltie 






_ „ 1 Dav'1 


Dm 2 \ 






1,0 1 


10 1 


Sav~iSRS~H¥S \ "iS 


2,5 I 


3,i 1 


Sav^lRSHVt 1 2,6 


3.6 1 




Sav-10RSHV2 | 1,8 


2,6 1 


2,8 1 


iiv~fiiIlHiv3 1 '"23 


3,5 1 4,6 1 



TSie foliowfii^ con$irMct iva^ usacf fpr the ehromi^pmd Me^^ of th^ tail-vanshi 
10 encoding gsnes. The coding sequence of the we!!~known subiiiisin BFN' protegee was 
opa-atiejially iinked to a tripfe promot^j a marker gen# was fu$&(^ to this Ca speotmdmycin 
maistahcs gene m^rrourtded by resoivass ma^^ltesj, and pediata lyase enoorflr^ gen^ from 
Baciiiiss subWm -mm fused to the construct as ^arikihg segmahts conipHsiiiQ the 5' 
ipoiynaefeotjcia ri^ion upstream fyfmD-ytmCj-yfmB-yMsA-Pai-start^ and the; 3' ipolynucfeotida 
1| ifeglof? down^tt^m |F^f-0fid-yilS~cit$(stsft)] of the taii^-varlant encoding |k)|ptuc!adty©, 
fespefelixtely, ths Iritsgratlonal oasisette was made by tlie joiftlng of savarai dMf^mt PGR 
^agmenb- ,AtertiSi© final PPR raadttontha :POB producj; us^d fer fensfooti^tlon of mturajly 
compatant B, saMffs eaife Ot^e (Aom denatsd was selected and confirmed by 

saqutiieing to eoolain the coiractcsjnstruat, 

20 

The Pl3598^3Tdo?i0 thus osntafr^s the 



1 , The ■SanMng regions 1 00% Homoiogous m0m of th® B^siibtls gensme (apj|S^ars as 
the upstream tagmeft ytrjiO-ytmC-yfrnB-yfR^Feisfert ari«^ |h;@ filowR^tfeam feagmsntt 

2. The Sjp©eSriomyoin mslstsinee gene figriKeci by Resolvass $ites {ms}.- 
g 3. The tnpie promoter region plM$ €]fy//iy^niRI^ «^Hiefrig iead^^^ 

4. The BFN* Op©f5 Reading f^ram 

gomtructlioft of t rij^te pmrnoter BfiN^ eassaltfe 

A PC?^ fr^giTJent corsipflslRg the Intagmtlonal cs^iset^^e far s BFM' iiferar^ 

1 0 constructed; tiiue Dpembly linkliig e tripSe promoter (es «lescribed in VVO 9SM383S; Ndvo^mes) 
to a BPH' exjsres^lo!^ c;:si$sette INjm a Baaf/(/^ ©trsuri. The^lple promoter {$ a feslop of an 
Optimfeed arnyt-Jferived pribmater (sas shown m WO §3/10240; Novo2;ymes) with tvi'o 

prornbies^ scBAN ^fid cjyfiL^ the first is a ossisensys v^rissoni of the BmBkm 

aitiyiG^filuBhams amyi^m BAH promoter, and the tatter snofodes a mRf^A-stefe'lising sequence 

ia (as tecnbed h WO 98/4$S3$; Novozymes);. S«itaMe prlrners can N ^^erlved frprrs the pufeiicly 
avaslafole sequences (Vasantha, N, at af. Gensi^ lor alkaline pnste^ise and netstral proteose from 
BacWus amyl&isqisefaomns contain .a large opm reading frame between the regions coding for 
signal sequence and mature protein, J, Bacterlol 1SS;811 (1984) EMBL: aceesslor^ No 
K024965. A Kpnl and a Sal! restriction site was introduced to flank the FCR fragment at eacii 

20 end, usihs the prin>ars; 



#252639 (SEQ ID NO; 11); catgtQcatgtgggtaco.gcaacgttcgcagatgctgctgaagag 
#2S18S2 (SEQ !D NO: 12); catgtgcjat^ggtii^acogattatggagcggattgaacatgcg 

2B Ths Kpn! and SsN restrtction slles In the PCR fragment were subsequently used to 

clone the fragmerst into a Kpni-Sal! digested Ped-Spec PCR fragment. The Peel-Spec ftagment 
CQmpnsas a Spectinomysifi resistance gene inserted In the middla of the S,«a5fe- Pecteie lyase 
gene plus approx, 2.3 Kb of upstream genomic DMA and approx. 1,7 kfo downstream genomic 
DMA, The P@cl~Spec fragment was produced by PGR an->pSfficatlon of genomic DMA from the 

30 S.jSwMI/s ^trslin MS1p3, using the priniers; 

#l t0541 (SBCl ID NO: 13); gGgttgssgaegcgcggccg^gagcf ccgtltggctga^^ 
#179542 ID N0: 14); gcgttgagacagctcgegcsagggaaaaatggaacQgelttttc 
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Conslfugtion of ^81053 

The ^B1dS$ 8.stib§im strain eonstryeted fey deie^or? of the pectatelyass (Pel) 
gesn^ thrpugh Integmtbri of « PCR pro^uot irtlo a wild-lype B-suMiiiB typestrain g6noRi#. Thfe 
was acbieyed fey s Ppf? ampiiication of genorpc DNA dliieotfy dcjwnstrBam and upsteam ctf 8i® 
5 FeiSate lyase gen© ofthe B>mM% 

Th^ ends of tha f gnomic dlE«fC^ly preoeding and pmee^lng th@ l^i^i gen§ wsm 
siong^t^ thfOMgli pfimer insei^on «f ^^qusncss Iseing 100% homoSogoijii? to P!#i sequences 
dMin^ed by file ends of a tNrd fragment encoding a marker ge»)e suTO by Resotvase 
(R§s) ${t©s, fu this pgrtjoMl^r c^se ttio niarker g^m {$pec) conferred! resfskncs to 

10 sp^tloomycifi, and it was atuated betwieen tiisfo Res sites, aitogsther ptBmni on Ih© piasnHd 
pSi3S8 |dascfit?ad In US patent 5,882,888), ThreiB d%r8r>t RCR frsgnisnts were iniilly 
producsd. 

FragmsM 1: tilis fragment covers from the yfmD gene to Sie mMie of tiie Pe! 0ne and 

11 intoduses an overhang to the Res-Spec-Res cassette at the Pd gene. Th size of fragment 1 is 
2.8 kl3. The frngnient was produced by t PCR ahipliticate eiiromdsoifts^! B¥4A Mm Sse 

strain PL2306, using th^ prfmers; 

#179541 (SEQ iD NO; 13), and 
20 #1 7@S39 With overlap to #1 791 54 Spec primer (SEQ ID NO: 1 S): 
GcatttgatcagaaKcaotggccQtasititacaaccailgcggaaaata 

Fragfrsent 2: this fragment covers from llie nilddfe of the Pel gene to afer the end of the CstS 
gene and inlroducing an ovsrhang to the Res~S?5ec-RwS caesstts at the frsiddle of the Pel gene. 
25 The Size o? fr£5gm«r?t 2 is 2.3 kb. Ths fragn^e.nt was produced by a PCR amplification of 
chramosomal DMA from the B.suijfe stfaisi PL23D0, using tlia pnmers: 

#179542 (SEQ ID NO: 14), and 

#1 7SS40 with overlap to #1791 53 Spec prlrr^a- (SEO ID NO: 16); 
|0; ^gpldcagatctggtacocgggtctagagtcgaegcggcggttogcgtcoggacagcaM 

Fragment 3; this fragment eorslalns tbe Spasslinom by Res sites and DNA 

sequsfices lit the ends overlapping With PCR fragment t and 2. The sli<e of fragment s Is 1.6 kb. 
Fragment 3 was produced by iPCB arnpflfication of plasrnld pSJ3358, using the pnmefs; 
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#1791S4 (SEQ ID MO; ITy^gttgtaaaasfaegice^tgaattstiatcaaatgg 
#179163 (SEO ID NO: IS); ccgcglcgacactagacacgggtacctgatotagatc 

Sta ndard sond iti ong f 9 r the PCR magte 

F<>r the PCR ampilealfons of frngmerit 1-3 the HSFI Expan*^^^ sf®^^ {Ro<^0) 
was used to§#h0ptMth 8i8i foitowin9 CY<Mn§ scheme: 

To this miK 3 |i! of D^iA {apx, 100 ng) atid S.TS pi Hnz^e mix (use hot start) Is added. 
Tdtail volurne is 50 \il 
Thecs^ciing pfoflieJs: 

i cycie of 120 seo est 04'C 

10 cycles of 15 s^s; at 60 sec atBO'C, 240 sec: at 72 ' C. 

2a cycles of IS sec at 94^C, 60 sec at 60- (180 sec gi 72^0 add 20 sec pr cyc^&l 

1 cycle 600 sec at OS^C* 

The three PGR fragments were made and joined In later JOI^•il^4G-FCR reactions, 't he three 
PGR fragments were sangte sharp bands and no gel purification was necessary. Only Qiageo™ 
PGR punfication was performed: prior to the foiiowing JOINi.NG-PCR. 
JOlNliNG of tagmerst 1 3 (same procedure for fragment 2 + 3): 
5 pi Buffer 2 

8 |ji dNTP*s (1 .25 mSVI eaob) 
S.O \il Fragment 3 
6.0 |jl Fragme-nt 1 
9.25 }jl water 

1 cyoleof 120se©ai8<i'a 
Break, .A.d<tEmyme 

10 ejfcfes of 15 s®c at 94 -Cj SO sec at 6D*G, 240 sec at 72% 
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1S csyotes of IS see at W$m #;6Q -CjCIBO see at T2'C SiM W mp pr cycle) 
1 e^qSe 600 sec at 8@'C. 

S After thefstst cycfe at 94*G for 1|0 sfee thf ire Is a bfeaH, where p:7S Ml Hnl^fme miis is a*led< 
Total volume is nm A^.O pi 

Aft©r th© WtfallO eyclas, tfiere is another i^eak |n th<& oplisi^ and far fmgriiont i+l; \i 
C20m^ #179S41) anci a,§;|i! (gO pM#17S163) am added and for fragment ^-^-S: 1,5 (a0;4M 
#i 19^42) arid 2>S {^0 #179154) afi© ^ddecf and tht cyding is coifiSnusdl^^ 15 cyclfs 
10 more. 

Th® PGR produs^s were ten gel pyrlfied; The ■me of fragment 1-^3 should be 3.4 isb 
snd sfee of fragment 2'*'3 shouM fee 3.4 kb. th^se two frsgrftgnts were joined to a iasi PGR 
rigiac^sn (Es^arid^'* long system, Rodie): 

IS. 

14 lil dNTPs (1.25 mMeadh) 
S.0 Fragment 1*3 
S.0 Ml Fragmenl 2-^3 
£6 17.76 pi water 

After Use first cycle at 04*0 for 120 see there is a braai<, vviwe 0.7S \ii Bm^im miK Is addsd, 
Tctal voiums is now 46.0 uL 

After the tnstiaS 1 0 cycles, there is arsothe? bmak In the cyclsng srsd 2.5 pi (20iiM #1 79S41| 'm<i 
as 3.6 (20 #1 79642) sdded md the cycBng is contlnusd for 1 S cycles niore. 

1 cycfeQf 120secat84'C. 
8r«ak, .Add Emtyme 

10 cycles of 15 soc at 94' C, 60 sec at 60 C, 240 sec at68"C- 
Break. Add Primers 

3d is c>'cles of IS seo at Mp, 60 sec M 60'Q/18iO sgc atSS^^^ pr cycle 

1 cycfe 600 s^s at 68*C> 

The sfee sf the lslrsed PCRfragmsht is 6:8 kb. This PCR fragraeM was purified usirig a 
Qiagen'^ PCE puHficatispn kit, snd $ pi of the 00 eiuted DNA was us^ to transform a 
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standard as«MI$ strain. Mer ti^fisfbrmatbn ealis W6*b ^prs^d oMo LSP0-12Opgftft^; of 
Bp&Q{\mmydn. Ns^ day mom thm 1000 poSoniss 'mm ^®m. 8 of thess wsm che>(^®d using 
PCR priEKtm from last JO!N!NC§ PGR ampPeattpn yfeldsng P0R t^grtisnt of 8.§ i# r#mr than 
th© S.S kb ©xpecfed if deieSlon had not ocmirrBd, Fus^hsimom, tjhe pepfMeiyas# actlv% of the 
5 Gloms wa$ slieoked wttli thi^ Manaiil ImmuiRoassay, whfoH sJipwed f?d fi@actMt^ towards IN 
pestateiyase activity. Tili^?$ taken together vsith Sis Spec n^lstance tells as: that ddetion bad 
oceurred. On^ ^ch cldsla was selesSsd ariti <i^oted ^SiOfS, 

InserKen of BP^ I' ^xgrnssiog i cas^iett^ ad jacent to the ges^soe e-fes in y B1M3 
10 The iigatbft mik of the digested PGR amplified tdipie jsrornoter BPH m^rmmm 

c^0B^B and the Kpnl-Sa! digested Raci-Spec J^R fr^m^it tennpifte lo a PGR 

ampMeatloh using the PGR primers #179541 arjd i1T9S42, this resulted 1ft a PdR ^gmer?t of 
approx. § kb, Milch was used to trahsfdirrt RLIOOl (Old0fich;^n, B at al 1980. Cterjing 

df aldBi which ^nc(xJfeS alpharacsstQiactate desarboxyfase, an exaehtym fhsm Bacilius fere^^S; 
lg J, Sactenol, 172, 431S^321:): competent cejis. The tr^hsfdrni^ <^!is were pteted on LB-t20 
|jg/rni SpectinomyGin agar plates with ^im hislk. Spsstlhornycin resistaht coioniss with large 
skttn rpMk ctearing zones were res&«aked on Spectinomydn agar plates and snafys&d for the- 
ihtegratfon of the PGR fragment with PGR using ^ primers #179541 (SEQ ID NO: 13) and 
#179542 (SEQ ID NO: 14). 
20 Appearance of a 9 kb ti-agment indicates that the PGR fragment has been integrated 

into tl^e Snost cell genome. Severa; of these clones were sequenced to confirm intiegra^on of the 
8>cprSssion csssete, one such cione was selected and denoted PL3SS8~37. 

Example 4, Constr^jctson of piasmid-tjome ciiromosoma? sntegratlonal cassott© 
25 An E.coil piasmSd-foome fntegrettonal cass-sUe for a iibrasy may be constructed in vivo. 

An iiitegration cassette to be used according to the method of the inventiors may be present on a 
EmoU piasmid {which is capable only of repiicaticn \n.E.€cdi\ not m B.subWis), ti>e piasmtd 
compflsing; 

{) The ONA ses|uense encoding the Pre-PfOH:iomalhs of the sulstiilsin protease 
30 cjohnmcnSy kn^wn m Sa^iRhase, preceded hy and op^biy linked p 

11) a DNA seciaence cohiprising a mRNA stafelii^lng segment defived in this particuiar 
case fros^ the Cryltla g-sne; 

ill) s markor gene (a dilofamphahicdl mslstenca and 
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iv) gsnomie DMA from BmSim mtftlM as 5- ^nd 3' flanking segrnenlc The 
homoiogous 5' poSynuclecstlde region ypslpearn of the polynucteatfde |yfmD~ytmC~>,'fmB--yfmA~ 
PBl-startj, and tb& S pofynudeoilde region dc>wnstr«am of the polymiejeoticl# [PeNn«l-yfl^- 
eitS(8taft)], respectively. 

S lh& casss^Sta was ma<J« &y siv^rai cloning stejss involving dsgestSon of ptlGI § pSasmId 

m4 PCB fragments with appmprf at® restnotiton ^ndsnucfesssi sites of sevsmi diWemnt FOR 
fr^gmisnts m the geriieraliy used plasttiid pyGi 9v iigsfeii 0 & fragment into a 

piasmid, the ligation ms3<ture was transformsd M<i ©feietroconip^tent DHSialpha csils that 
w^ro pmparBd for and transfonned electfopoptior? using a Serse piiiser"^ etectroporalor tp-? 
10 BIQ-RAD as dssoribed by the supplier. One final piasmid cons^sS: v\fas confsmied by 
ssqueneing to s^ntslh the cprract construct as outiined a&o\fe, and t wa^ denotod pMB1508, 

The pMB1S08 plasmid thus contains the 

i) TM CiylilA mRHA stabiiisihg iead^ sequence ineitjding a ribosomef binding 
ig sequence (R8S). operatlonsaliy iinksdio 

11) encoding thsPra-Prc^dPhiams of ih^s^^^^ as Si^vln^se, 

induding kpnl and Hoti sites for cloning; 

.81) The chloramphenicol r^sistanc® Op^h; 

iV) The 3' downstr«ar?> flanking region [P#<end-i^SrCitS(start)3 which is S9~100% 
20 HornoSogcus to the region of the B^mMffs. 

The four elements listed were doned sn the pUC'IS vector (isolated from £col/ ATCC 
37264; Vieirg J> Messing J, Ths pUC plasmidS; an yi3mp7-der!VBd system for inse.ftion 
mutagenests and sequencing with syrsthstfc universal pf"in"5e!"s. Gene 18; 1982,) .in the 

2S EcoRI and Sail sites to give pMB1508, In order for the resulting plasmid to integrate effeciently 
to a specified site of th B.S'JttlUs genome, a new strain was established. The new strain is a 
de.nyatlve of Bacillus subtiiis 168 BGSC accs^sion oursiber; 1A1 168 tfpC2 . The strain was 
mads competent and transforn^eci as described above. Using elements from the PL3SS8-37 
clone described above, the new intsgratlon strain denoted MB1510 was established and 

30 characterised to contain: the Ibilowing al^iiehfe tOfn PLSS98»^3?: 
I) TIb tdpt© promoter and the mRftt Mafefllsing element 

il) Flanking segs-nents CQmprlsIng the foltewlng homologous polynucleotide region 
|yfmD-ytmC->fmB-y1biA~i9fel-siS!tJ upstrete Of IHte tdpie-profndter, and tha poiyfjucieotide region 
IPeNnd-yfiS-cit^Cstariili downstfeam <^ 



Thus, wben using MB1§10 competent ceiis, it is possite fer the iSMBISOB (or 
deriVatlv'ss thisrBOl^ to ^Jifect)^ sntsgrats Ifito Ihe geno?-ne of wh^r© th© fiankirlg 

r^ions In teion wlll^ ^® t^'pl«^^-pmm0t^ and ml=iJNA Mabflisirsg ^le^ti^nt is located, resySing m s 
§ c©n$trMCllofii whi^m thelrwomihg PrePm pl^A of pM^ISOBDNA has been pMgmtied ln 

the cprrei^ reading frame with Ihe tnp^-prc^oter, the mRNA stabilising etement snd the BBS, 
Thus re^ylting In liigh esspresslon of tiie iMegraled §ene from t^s prornot^r elemsiiis aimidy 
present on tfie genoms of MB1B10. 

Trsil^csrmBtloii effeclendy ms estltblishedl for ti-ss B.m&iM strMo yBISIO 
10 irmsfomxBi^ mHh E.coll prepared pia^liS pyBISOS. For fgither te^sng of ti^^e pofential of umt^ 
apprdael^, th^ Savina;^ ^needing igene of BstaiSus ci&usslwm^CH sfRpHfisd psifiig ihe iwo 



the resulting ~0,8 kb bf the Sm<1hiase Ifagrnent ar^d ifm p?s^B1508 plasfnid are 
digested with i<pni and Ho% and the resulting frsgmsnts sre then purifiied by agarose gel 
electropi^Qfesis, The two fragments are iigate<.l and the ligation rnixture is used to transferm 

2.0 cof?"spster!t E. con ceils wi^ich are then plated on lEJ-agar plat&s or piaced in liquid media fer 
gro\%th overnsJght at 37*C; both types of msdia containing 50-100|.jg/ml of Ampicifiin. Alter 
Incubation, a piasmid prep is made of the liquid culture. The purified plasmld is used for 
transformation of competent ceils of M81S10 {using 1DO--10.00D of piasmid per 
tfansformation. The transformed ceils are plated onto TY medium with .2% skinimsil^ and 8 pgiml 

M of chioramphenicol for selection. After overnight incubation at 37*C cieanng zones appear 
around those colonies wherein the Integration cassette is intesgrated properly into the cslis, 
indicating high Savinase e.xpression. 

This approach can also be used to make highly diverse tihranes of any gene of interest 
expressabfe in B,ss.iMIM, wber^ mthav tjan a gsn^ ertcpding one «nzyme> any expressahle 

3S p{;^r?ueleotlde Is m®®?ted Ir^tp the piaiSEmf<i jsiyiBlSOS ^nd biegj^ted Into the ftrain fPr 

subsequent sisrsening. 
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Primer #317 (SEQ ID NO; i^tggcgesatcggtaiscMgggg 

Primer Ndtl |SEQ ID HO" 20) oatgtQ^atg^gee^c^ttaaqgeg^ccgc^ctg^ 




Th^i^asmM p&fBtSOS has the followir^g components, Indlis^tsd by basepir posltfbrts: 
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BP 5186-39$; pUGIS sequenee from Ecoj? oSone ATGC 372S4, Vmm J, Messing J. 
The pUC plasniids, an MiSmpT-deRved sj^tsm for ms^rticm mutegenesb aiii^ $®qu©nci!f!g wsth 
splK^cumvars^tprimers, G©r^ei@; 250^268, 1882. 

BP S©6-i02i; EgoR i dofiing sits (8P3St6«401) apd ths CrylllA mR^JA stabiSfssng 

EP 1022«-1412; Enoades iNe Ffe~Pro s@S|yence of Savisifass and the A*i>;S cter^ifs^ sile, 
(Pfs-Pro part d^isafe^ tr? eg. WO 96230I3-A tie site and; the spring feetwesrs th© Pr0- 
Pro and Na§ was Iritrcsdueed by the PCR prlma-i 

BP 1413-2512: Th&Bgi II cl<>nlng site {SP1413-i4i8) and thi^ Chlorampheinbof ae«tyl- 
10 ^^nsfefas^ jspef^h of pDNIOSO (Dssaibsd Irs eg. Oldencfen.B,; Poylsan,G>B>; 
Joerg «ins$n,SX; A us^efa! eionfng vector fsr Badilus syfetilis. PliasmM 3D;31>2 (1S93)). 

BP 2S13^18S; The pplynucieotide re^toff pej^hdr«yBS*^cftS(start)] doW^ 
pelB locus of tj© ^enonia (as It app^aar^^ f rosin the iHibllca^^ and coTOspOiidifig 

database csf; R Kufist, N., Ogasawata, 1, Mosaer, "^146 other aathoifs>, H. yoshikawa, A. 
15 Danchin. '*The oom^iete ge^o^v^^ sequence of Oiiatjvposiisve becte^'um Bacslus subtiiSs" 

N^tum {1997) aso^a^s-asg). 

mi 51 0 has the following spscirtc festtires in md mami the P®1S IJjeu*: 
20 I) The tslplf promotsr aiid the rhRNA stabllislbg ei^iefit indlyd a l?:BS ^Ribosome bsndi.ng 

11) Ranking segments comprising the following homotogous polynuclsoiide region ^ftnD~ytmC~ 
yfm8~yrnv\~Pe-!-sta?t| usstrearn of the triple-promoter, and the polynucleotide region [Pel-sncl- 
ytlS"Clt$(st^rt)| dpvviistream of the mRNAstabslzing sequence. 

25 

^SMm8t.Mt1.§1.fi.8§nomlc.inmr#tlon s-ecjion (SEQ ID NO: 22) 

BP 1-2873: corresponds to sequence of Badllus subtliis genome yfmD-ytmC-yfmB- 
yfmA-P©l-slsrl (as R sppsaars f?x)m the pubHcation and cm&Bpomlnq database of; F, Kunst e! 
al 'The comptete genosT^e sequef tCs of tlw Grsm-positlve bsGterium Bscillus subtills" 
30 H&tum (1 997) 380:249-256), 

BP 3102-4082: The triple pmmdter amd CrylllA mRNA slabillsirig Piemen! jplus ESS. 
(Described above In PL3598-37 construct). 

BP 4QS3'57tSj The poiynuclebtlde rBglon [Pel-end-yflS-dtS(sta)-t)| end of and 
doWJStresf^ of the pelB locus of the B.sub&tB §mom^ (sts It appeaas^ ffom the publicaion and 

m 



sorrsspohclsng database of: E Kunst, U. Ogasswara, I Moszsr, <148 qther authors>, M. 
YosS3^awa, A> Danchfn. '"^hf eomplets gefsom® ssquenc© &f ikB Qtmi--pmMv& hst^Mfkm 
BrnWuB sufeftis" Nature (1 m?) 390!249~25S). 

This example shows tie conM^^ tail-variiant library. In Ms liferai^ Mo amino 

acids \sf#ra tJitrsduc^ at th^ G^ternfanaJ of tie 10R pfptelh, Siieh a TaiMfery ma^ ^se made with 
the method dasc^te^sd absvfe osi^ PCR primers in a PCR raadlon using genemfft 



wherein N » A, €> 0 sr T; and K. « T or <3, 

the rfe^Uitmg PGR prcsduct was ^tp^sted with EcoH \ snd Sg/ Ji and llgated JMo EooR ! 
aod Bgff il digested pMBiSOS. HereafteFfoliowing the principle described above, 

QHIarampt^enicoi resistant B&clfhm subtllh iransformants pK'jked by a robotic 
20 dolorsy picker from a bsoassay p^ste and {ransferred into a 384 weli microtiter piate (iWP) 
containing O.OS X JY supplemented with 6 mg/i ciiiorampiienlcoi (50MiAvell), Tlie IVITPs w®f© 
sfioubated at 26'C for 7Zn. After incubation aach weii anaiyzed for pj-oieolytie activity. 



2§ for det>3nninatson of tiie tx-vo taii amino acids in each ts'ansformant by DNA sequencing, the 
sequanc-ing resuits are summades m tabie 4 and taye 



10 



(SHQ lb NO; P); 05«^^(soagtga;attb0igt^aaagtgc5 
1606 (SEQ ID NO: 24): ceagatctctaisiktnki^tacggagtctaactc^ccsa^^ 



The tiiirty Badthi$ suttiiis transforn^ants with highest proteolytic activity wer-s selected 




m 



IQ 


2 1 


QP 


2 




2 1 


IT 
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TQ 




!T 




QQ 








Total 


30 1 



TabSe 4; column one shows the mmto acid ssquenos of the tail, and cakimn two $hows the 
number of BacSIus sutMs traiisformants sequenced with that particular M tall sequence. 
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Tabi© 5: The tab!® siiows the amino adci wMch could be Introdyped by the primer ueecl for the 
lil^sry coftsiiyiict and the actualfindlngs by^ DMA seqtiendng of ths thir^ sobnie^ isoiated fnom 

to ExasBple 6. Construction of BacUius mMimMraim L2, L2 HVO, antl L2 HV1 

A BacMas Buttifis strain was made lanaiogousiy with the constructiQR of the BscHkm 
subtiiis strain 10RS, with the DNA :C<xJifjg for the pro-form of the S2A protease from 
Ngmrdiopms dmsomilfm BisbBp. DasaotivifM QSM 4323S, demoted 12* fiised by PGR In frame 

S7 



M tii^X>MA coding for tHe signal peptide fi&m SAViNASE"*^** {a ts^lhknms?n commefoM pfc^^ss 
derived irom Badffus Qk^mff, av^ulaNe from N^yox^mes, Denmartc), the r^sMltsng slralrs 
demoted BmMm mb&is Say-L2v 

Tb© PNA s^u^nc© indtidlng ih& qqdhg rsgion for ths pro-^n^tum $2A proteas® ir«m 
S Nm0f(Mop&f$ a^issonvillm subsp. DstBsonviM DSM 4323|, s# sntpilfl^d vsfltil^^ prte^^ 1423 and 
147S, IS sh<Mn In SEQ ID NO; The sJorfBspondlng gncsded pro-^fm amino aad secjuencs 
for the 12 protease is shdwn m SEQ ID NO: 28, 

147S {SEQ1D HO: 27): ^gagcgsafgaacatgepSaggt!?^ 

10 

Tv?a talf-varia^ils of this eonstect were also mad^, T^l variant Say~jL2 NVG wm 
constructesi to have 8 amino acids exfe-a In the C4erminue; QSHVSSAP |SSQ iD NO: %% by 
usisig the OHk ms$mm4 eKtehsion inserted in front of the TM sfopco^^ which i;| sbo%%i m 
S£0 ID IslO; 4. Tail variant Sav-L2 HVi was eonslructed to have 4 amltio adds exts^ in the 
lg tem^ihu^: SSAP* (SEO ii^ NOi S), by U^lng th& DN^ seqy^fibe sxten^idn in front Qf t^se 

TAA stopcodors which is shown in SEQ ib NO; 6. IBoth tail vaflante had the SAVlMASgf ^ slgnal- 
peptlds erjcodlng sequence ftiised in friame wjt!^ tl?e pfo^rr^um anjeoding sequeaoe, just like in 
Sav-~12. 

Tns Sav-L2 gens and the two taif-variarjts Sav-LS HVO srsd SaV'L2 HV1 ¥/ere 
SO; integrated by namologiius recombination on the Baca'ys sabtms MBIDSS host ceU gsnoms asi 
qytilned above. Chloranipbenicoi resistant trsnsfcrmants were chscked for protease sc|iv% on 
1% skint mijk LB-PG egar piates (suppfementeci with 6 Mg/mi chloramphenicol). Some protease 
positive colonies were further analyzed by DNA ssqiiencsng of the insert to confirm the correct 
DNA sequence, end one strain for each constaict was seiected and denoted B.sutMs Sav~t2, 
as B.subtiiis S3V-12 iiVO, and Sa¥-L2 respectively. 

Example 7, Fermentation yields of the B0Ciikis strains of example 8. 

The three B, subiiiis strains of example S, were fermentsid on a rotary shaking table in 
S0O mi baffied Erbnmey®r flasks cohtafninf 100 mi TY wppsernanted 5s?lth S 
3d dilijrgmfjtssnlcoiv Six EHenmsy*^ lasks for sash ef ilie three B, subW^s strains were fermented in 
paraitel. Tvm of tha six Eriennis^er fiaske w 37*G (259 rpm), hvo at (260 

rpm), and the last tx's^o at 26''C (250 rpm). A sampie was liken frdm each shaka flask at day 12 
and 3 and anslped fsr prcteolyttc activity. The results are shown in tables 6-8, A^ it cm M mm 
from tebids 6-P» the s^eet of the tails also itrnMsei the exp-esslpp ley#! for ti^e Siy-L2 prplsase 
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from Nixmmmpam dammvfMi mb$p, O^^mviM DSy 43235 wh^en expm^ssd In 8. mbtiUs, 
An Increase cjf up to 40% % observed InM^ expeHment, byiowr^fl improvemenife obssfve<l for 
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The DHA sequence osdislg for the prd~regfoh from the 12 protease from NocardiDpsis 
fhs&onviM suijsp, D^smmtim, DSM 43235 ts ^now ifi SEQ jD NO; 29, an^^ the; corresponding 
amino acid ssqiisn&e is showri in SE^ ID I^G; 30, A Baciflus sub&s^mn denoted UiiOR, 
Similar to BaaiifH sut^Sk strairt lORS, but With the p?IA boding for the pro-regiofs of tlie H 
repiadng the p?t>m§iOfi of lORS, vs?as made. The en&re L21 OR protease encoding! sec|y@nc» 
Ind. tee pro-r^glon of 12, is shown if? BfeQ iP ^10; 31 . 

Two tai! vMmtB of the afesove otsnstruct ssfefe siso m&de. tail variant H\/0 was 
constructed to have 8 amino adds extra In ths C4ermtnus: QSHVQS,AP (S£Q ID NO; 3) with tl^e 
20 DMA shown- In SEQ © UO: 4 inserted sn front of the TAA stopcodon of the encoding sequence. 
Tail variant HV1 was constructed to iiave 4 ammo acids eortra in tt5« C-tenrsinus; QSAP (SEQ ID 
NO: 6) with the DM.A sequence shown in SEQ ID NO; 6 inserted In frdrit Of tlie TAA stopcodonf of 
the enooding sequence. 

The 10RL2) construct and the two tall variants were integrated by hornologous 
2:5 recornbsnation on the Sacf/fes subfih's MBfQSB host cell genome. Chlorampirenicol resistant 
transforrifiarits were discked fsr prQt;eas© activity on 1% ai^im m!li<; LB~PG agar pLgfes 



(supplemented Witt} 6 Mg/s'«! chloramplwhicol). Some pmt^se posjtiv© csiortiss wwf feithsr 
analyzed by DNA sequencing of tile Insert fe^c^ correct DNA sequence, ansii a sNlf^ for 

& 

Example 9. f^^mienta^iiprs yields of 10R tsii-varbnts with hetsmbgous pm-regton 

jhB mB. BuMm spm^ i(m, iors mm imm hvi, mm i^iqb nm, md 

121 QR HVIj W0m fetrmnWd m a rotary shaksnf t^bi© in SOO m! baffled Erl^nmeyBr flasfes 
corstainliig 10S ml W suis^terneiiited wHh S-mg/! chloramphenicol/ SkEi^^ foresci? 

10 of ihe B, sybtliils strains wo?© f©ftn^ted in pafsH^. Two of $fic Sfl^moyef fissfes vsifero 
snoubaf^^ a?^ i2m rj:mi), two 30*0 (S^O r|sm), m4 tS^ iast t^j at 26*^0 (SSO rpm|, A 
ssmpie was taken from each shske HiSisk at day 1 ^ 2 and 3 siid analyzed for pratsoSytic aGtivsiy. 
Th^ jhesufe arife shown in figure 1, and in tabl^ 9-11, As it Q$ti tje seen frorri the rasuits, tha 
affect of the exohange of tlio proregion from 10R with the ■ prorSigion frofn the 12 pr<»tmm 

i6 resuitesd m a surpHsingly high improvement on the expression tevei of the 10R protease as 
itjeasured by proteoiytle Activity in thife duiture broth at 37'C, The effect is mosft proflouhded In 
the two tal! variants, 



Tabie J, Relative proteolytic activities at 3?'C. 
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Tablfe ll. Re!a8ve pmteoiyttc actl'i?fSas 3t 2e''C, 
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EjCiSmijIe 10- Repast of sxampS&s 1~§ wjth ottier 10R-llke proteases 

Gomptestely analogously with the above ©xamples 1 through 9, sM^f^xpsnrserit^ ^re 
canied out with the proteases of the foltowsng NocarK^iopsfs strains; 
B (a) NocanJiopsis dassofwilhi NRRL 1 81 33 as described in WO 88/03947; 

(b) Alocard/opsfe sp- NRRL 1S262 as described In WO 88,''03947, the DHA and amino aaid 
sequsncss of the prote^ssa derived from Nocardiopsis $p. NRRL 18262 are shown in DK 
patent application no, 1996 00D13: and WO 01/58276 deschbes the use m animal feed of 
acid-stable proteases mlated to the protease derived from Nocw^ppm m- NRRL no, 

10 18262: 

(c) Nocardiopsis J^ba DSM 15647; the amino acid sequence of the protease is SEO ID NO; 33, 
the encoding nucleotide sequence is S£Q iD HO: 32; the gene is isotated from the gerjornlc 
DNA of this strain by PCB-aniplifleafiqn usins the tswo prlniers; 

1421 ^SBQ ID NO: 34): 0toatogatcgoetcggctgcgacoggccGcetcc<s3dsgtd 

15 1 604 (SEQ ID NO; 35) : gcggatecf atcagg^cgcggggtosgaeo. 

{€) NocBtiifopsk prasin^ DSM 1S648; the amino : acid seciuehcei of ti^e protease d S£Q ID N6: 
37, the encqdlng fitJdMti<Je sequi^ce Is SEQ iD NO: 35; the ger?e Is Isoiated from the 
gersjfnfe DNAof this^feain by PCR"afnp!ifi<^^oft using th«twfe primers: 
i 346 ^SEQ ID NO: 38}; gf ts^tcgat89P®tegg?;^8ccaccgga<xgctccGcc^ 

20 1 602 (SEQ ID NO: 39): gcgigatcclattaggtcc^gag^cggacgcjcccagg^ , 

(e) Moca/idfcpsfe pm^^ of the jsrotease is SiSO iD NO: 

41, ih® eh0dsng nuci^ottde s<SiqUein(je Is SEQ ID UO: 40; the gene is isdfat^d from the 
gsnomic D?#k of this strain by Pd^-amplifjcalipn iisirsg fee kfo pHms!^: 
1603 {SEQID NO; 42); 0tcatcgategcatcgg%ccsae«?gga<isseldcc?^ 1602 (SEQ ID 

2S NO; 39). 

Examj^e 11 in vivo mm&^Si&iTlc p^tf&rmmm of a jprntease ft&m DBM 4MZ$ 

The perfp|^anc0 of the Nocatdiopsis (Mssomlilm sobspBcies dsssonvsM DSM 43235 
pmtsas© assayed In a rnonogastric //? wlra digestion model. The perfon-nanse of a purified 
30 prsparation of th© rnatum part of the pplease baling SEQ ID NO; 28 {prepared as described 
above) was tested In m M v!to model strriuMsng the digestion in monogastrtc afvimaSs. in 

m 



pait^cular, the protease was tested for sts abSity to Improw soiubiilsatioo and <:lf^esiion of mafeeA 
SBy {mafeeZ-soyfoean mea0 prote!ns> is the tabSes baiow, tt^ protsase isf designated "prptease 

T^e vif0 sy^em consisted of 15 flasks In which mas2#^B&1sul?st mlSalfy 
§ InciUfcafed with HCI/papeIn - siimy!afe"|ig gsstrle d|g#sti<>n - artd isul3$e!«;jU©Fitiy vifith fsaficf^iitK ~ 
slmulair*g Intsstlnal tSlg^tloa 10 of th0i[ask& were dosed wfti t)@ fsroti^se altha stsft of llie 
gastric pasa whereas th« r^alnlng fiesks ^ryad as blanks. At th0 esid of ih& pte^tirsai 
mcubatfon phase sampies of m Wfe dlgssta were r^pved md analysed for SGlubiHsed and 



Table 15: Oiiitiine of i^^^^ ppocedura 
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f sfminSli Frsoubiiort 
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Cojielitioji s 

Siubslrat©; 4 g SBM, 6 g maize (premised) 

15 pH: 3.0 stomach step/ 6.8-7.0 snteatinai sfep 

HCi; 0,105 M for 1 ,S houra (i.e. 30 mirj HCI-subs^ate pmmsKlng) 

pepsin; 300Q U /g diet for- 1 hour 

pancreatin; 8 mg/g dift for 4 hours 

teniperaiure; 40"C. 

20 Replicates; 5 
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8;105 M BOS contsiriing 6000 0 pepsin per S mi 
1 P NaHOQa cSBteifsIng 16 mg psfscreafe per mi 

Er^me srotao detemiiMtioiis 

The ampuni of protease enzyme prolefn f n wHgt foitows, Ensyme iProtefn is afoismvliited 
EP} Wcsii&jM^^ on iim b^^ m& ih^ amino adio! ^wsrsc^s (iimlfio acsid 

10 compf^ifens) tjsirsg ti*e prMpiM mitined in S.O.GH^ & P,H. von Hlp|>^i, AnaSjj^Si^l Biocl?em!sJfy 

Thife expedmentSi! profiedurife 1^ accort^ing ts the abow Qutlirtsi- pH wai^ me^uned at 
is time 1 , 2.6, iiours. fnpyisatfSfhs were temilnatsd after 8 hours and sampbs of 30 mfwsr6 

remsved and placed on \m befer© cehtrifogstion (10000; x g, 10 mlh, A'C), Supematartts wsre 
remssyed and stored ^20*Co 

20 All samples were analysed for % degree of protein mth m OP A msthod as wsilas 

The Degree of Hydrolysis (DH) of protein different samples was detsrsfiined using an 
25 semi-automated microtiter plats based colorimetric method (NielssnPAl; Fetersjen.D,; 
Dambmann,C, Improved f^iethoci for determinincj food protein de-gree of hydrolysis, J. Food Sol. 
2001, 66, 642:-646), Tfie OPA megent wss prepared ss folbws; 7,620 g di-Ma tetraborate 
decahydrate and 200 sodiun^dodecyl sulphale (SDS) were dissolved in 150 ml deionlzsd 
wafer. Ths re@gt?^t% were completely dissolved before coninulng. 100 mg o-phtliaWialdehyde 
3S W% (OPA) wm dissoiysd in 4 m! ethanol. The OFA s^lu^dn was transferred quar^titativeiy to 
the above-mentioned solution by rinsing wte deionised water. 176 ditMothreitoi 09% {OTT) 
was added to the solution t^at was rsade up to 200 wlii tleidnized water. A serirse standard 
fOJSiS nmcjv,^} was preparsd lay sofutlii^hg SO ir^ serine (yerci?, Germany) in 500 m! 
deipnizfed wMer. 

m 



lh& mn0& m^ by dilutsrsg each sampte to an afcisorbsfiss (280 tm) df 

afeQUt 0,5. ©anerally, su|S6rjtatants wem cliluted (100 h) using ao ati|om#®d Tec^n 4lyi»iis 
^atlOf> Cy|nnedorfi Sv^tes?|an?l)> A!^^ w^r® performed at 340 

nm using deiprsKed water as the epntrol 25 pi of samfste^ slan^sr*! and blsnd^ v^^^^^ into 
S a nilcfdtiter plate. The fiiifcrp^iter p|at# wa? Nsfertecl into m lEMS MF reader (talj%st©fli«, 
Fittlaqd) and 200p! of bPA reagsnt was automsticagy dispensed. Plates wares shaken (2 mm; 
TOO rpstj) befoM ffiea^yring afesorJaan^^ Rnaily, tile OH vs^si caioulatefii. Eightfold dsfermlnMiOf^ 
of all sampios was esrri^ out. 

10 S^tjffiMonMMy ^^^^ 

The ccsMent of sslMbillsod protean Iff su0erna|ao|& ^<jm In dlg^ted samples wss 
i^imatsd by quantifying crude protsir? (OP) mmg gB\ Mltmtim HPLO, Sup^rnatantis were 
tliaw^, fiitsr^d through 0,45 pm poiyo^rboisata liters atid diiutad (i;SS, v/y) v^s^tfi HsGJ. Diluted 
sampies wenfe chfdmatPgrsiphecl by HPLO tislng a Superdex Peptide PE (7*5 x 300 mm) gel 

IS filtrate CQiumri (GioMi), The ejuent used for isoprat^o eiution was SO mM sodlurn pihosptiate 
buffer ipH 7,0) cofttsinlng 150 ii)M MssGf. The tdtisi volume of elu^rit per ain was 28 mi arid the 
flow rale was 0.4 mi/rrsln. Eiutlon profiles: were recorcted at 214 nm and the tsstsi area under the 
profiles was determinsd by Integration. To esllrRate protein contef>t from integrated areas, e 
calibration curve was made from a dllutbn series of an ^.'ifro digested reference 

28 mafee/-S8M sample with known total proteir? content. The protein deten^rination in this reference 
sample was carried out using tiie Kieidahi metiwd (determmatipn of % nitrogen; A.O,A>C, (1084) 
OSiciai ysthods of Analysis 14tli ed., Washington DC). 

The Gonteni: of digested psxjtein was sstirs-jsted by fr^tsgrating the dvomatogram ansa 
GOffBSpof^ding io pepiidos and afrsino acids havirsg a molecular mass of 1S00 Dalton or below 

26 (Savoisl.; Gauthiefr,S,F. Dialysis Celt For The In-vitro ysasurement Of Frofein Digestibility, J, 
Food Scl 1S85, 51, 494-498; Babinsaki^^U Van.D,?^.J.M.; Boe?;H,; Den.H,LA. An In-vitro 
Method for Predictior^ of The DIgastlfole Crude Protein Content in Pig Feeds. J. Sci, Food Agr, 
1990> 50, 173-178; Bolsen,S.: Eggurrf,B,0. Criticai Evaiustion ef In-vitro Methods for Estimating 
Digestibility in Slmple^Stomach Anlmafe. Nutmion Research Revises 1991, 4, 141-1 62)< To 

Ss| dMermiiie the i$QQ P^iton dlyidShg fihe, tiie gel filtration colurnp was calshrated Msln^ 
cytochrome C (Boeh^ilger, Gemiany), aprotinlOs gastnr^ 1^ and substar^ce P (Sigma Aldricb, 
USA), as molecular mass stahdardSv 



m 



Th© results shown in TaWes 13 and 14 below indicate that the protease sncrsased the 
□agree of Hydrolysis (pH), as wisill as soiybie and dipstlbte p^ 



Tsj^j^ 1 ^LQ^ifM Hyg^^P^yMS.:(fiN). absolute and relative ya jues 
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Different letters within the sams column Indlcato ssgrsificant differences {1-way ANOVA, Tukey- 
Kramer test, P<0.0§). SD ~ Standa'd ©svfation. %0V « Co@ifide«t of Vadaftce CSD:/mean 
value) X 100% 



Example 12, in vitm aquaculture petformance of 10R4il<.e proteas® fror^i DS^I 4323S 

Psrforrnance of th© protease from Nocardicpsis dassonviMI subsp, dassomill&i DSM 
43235 tr? an aquaculturo hi vitro mociei. Ths protease preparation as dsscnbed in Example 3 
was tested ir? an aquseuliure in viiro model $in-)iilating the digsslson in coidwater fish. The rr? vitro 
system consisted of 15 flasks In which SBy substmte was inlMlly Inoubatad with HCi,/pepsin - 



smiMmg gastric digestion ~ and subsequaniy mih panm^atln ~ simulatiiig intestinal digestion, 
1 0 oft he flasks w ers d osed w ith t he protease a 1 1 hs start o f t he g astrk; p hass whsr eas t hs 
remainuig 5 flasks serx-'ed as blanks. At the end of the intestinal incuisation phase samples of In 
vitro digesta ware amoved si-sd anaiysad for soiubiSised and digested protein. 



Tgt)leJ8: QytjinoMagya^ nrpcedure 







c<sars» 


phas© 


[ 10 g ex^ide^ S8^ SMlsstrats, 
1 6^ ml HCI (0.1SSM)/peps|D 
j (4000 U/g stsbstrate), 1 ml qf 
the protease of iheJnventloB 


3.0 


WO 


t~Oifinln 


0^§Me digestion 


|7mLNaOH (1,1M) 


aa 






Intestlrial digeslidr; 


i S niL NaHCJ©3(1M)/ panensaiin 


: i. 




1=3? hours 


lnt6s6tEiriai digestion 

i 




7M 




i~24 ' 
hours 


; 
; 



Gonditloiis 



10 g extmd^d SB^ 

3,0 sidms<S stfe^ 6,8-7,0 M^sSnai st6?i 
O,1i50Mfer6houm 
4000 U ig ciM for 6 hOMfs 
8 rng/g di^t for IT hours 



to pHi 
HCf: 
Pepsin: 



1,1 MNaOH 

0.1 S5 U mi / pepsin (4000 U/g diet) 
m 1 M NaHCOs containing 1 6 rt^g pancrsatin/mL 
125 mm Hakomneh pH &0 



Exparinsaritai procedure for agyaM^^te jMte^ 
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ThiSi mpBtimmM pitsducei was accordmg to the abws ouain®, pH \??a$ measured ^ tim« 
1,5, 8 aiid 23 hoyrs. Incybsiions were terminated after 24 hoMrs and samples of 30 mL were 

mmoved and stored al -20*C- 

All tupemltanls wsra analysed usiji^ t>0 pPA msthESd (% d^rse of hydrolpss) aod % MJA 

10 Pre-treatm erit # sypamatanife with EA SY SPE eoiumns 

Befpfe «if};g!5^sls 00 AS<TA HPLC supematianiB f mm th« If? v|f«j system ^w^^ 
us^ng soljd-jjhase ssmple purlfatisjn, thte vsjas d^jne to impr<sve the chrornatt^raphy and 
thereby prevent uns^sNe^luiSoh proi!^ ar^d isaste^s^ Ths calymhis f(>M)d^MMm wer^ 
mM phass^xtraiGtlon eolumrss (6hromabdnid EASY SPE GoliaT5h$ from Macherey-Nag©l). 2 ml 

IS rnipQ water wss duted through thesolunlns by use of s vaouum chamber ^yacuum O.ig x 100 
kPa). Subsequently 3 ml 1?? v#tJ sample was dlspfenssd onto the column and elufed (vacuy ro 
0.1 x; i 00 kPa), the first % rnL of elirtsd sampi© was thrown away and a clean tube was pla<^ 
berteath the column, then the rfest of the sample was eiuted and saved for further dilution, 

SO Results 

Tiie resuiis shown m Tables 1 6 and 1 7 below mdioate thai the protease significantly? 
Incrsased Degree of hydFolysis and protein digestibility. 

Ta bie 16: Decir ^e of Hydrol ysis (DH) mea s^urad by ti>e OF¥>. mafoosl absoiut 
25 values 









Of total protein 


i Relative to biank :| 
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1 (mg EP/kg: * 




1 ^ 




SO 


1 %PH 1 %CV 1 


1 Sank 






i , W ; 


0.62 


1 100.0 j ' 2.42 i 


1 Protease ot 


the inventson (Si 


)) 1 5"" 








Different ie^ 


tes-s Within the s 




iin indicate s 


igrsificai 


It differences (1-way ANOVA, 


KraiTier tes 


t, P<Q-OS). SD ^ 


- Standard Deviation. 


%CV ~ 


Coelficisnt of Variance ~ {S 



value));: 100% 



li bjE JI: ■ Soiubised and dls©sted cmde pfotsesn rrseasured byJ^KTA HRLG, absokil^ and 




Dilf8ij"erjt fett^fs Withlrt the same isolufhn ih^^lcate significant diffeffenc€fs (1*-way ANOVA, Tukiiy- 
Kramer testi P<0.0$), SD - Standard DevfalJdn. %Gy ~ CoeWlcient of Vanance ^ ISD/mean 



TWa of th© B- syMi'Ss str^sftis of Exiampie 5, strain 209 mth ths^ amSrio sdd ^di-vanaftt 
1% and stfairi:211 witvth^i feli-vansnt TP. together S;ifiii3iM/sSav-i10RS, were fermenfed on 

10 a rotar/ shaking tabfe In 500 rr?! bafflsd EHenniifey@r fiasks cdntaiiilng 100 ml TV sypplesTiesited 
wtiH S nig/I chloramphsnlcdl Twelve Fxrfenmeyer ffasks for sach of th© three B. subHlis strains 
wsr© f©m"5erted In paraifel. Four of the twsivgs Effeiimeyer ffasks were Incubated at 37"C (2§0 
rprn), four at 30"G (260 rpm), smi ths last four at 2B'C (250 rpm)> A sample was taken from 
eadi shake flask at day 1, 2 end 3 and arfaiyzed tor proteoiytic e©tlvf|y. The resylts are shown In 

is: tables 18 to 20 below. 

As It can be seen from tables bsiow, the effect of the 2 amino add taste Is a 
surprisingly high improvement on the yield of the protease, as measured by activity m the sultijrs 
broth. The effect of the 2 amino acid talis h comparaois to the effect observed for Sav-IORS 
HV1 and Ssv-10RS HV3 In Example 1, 

20. 



Table 18: Relative proteolytic activities «f37*a 
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1,0 
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7,0 
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7,2 


7.7 


4,9 
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4 J 1 3.6 1 4,9 ! 




4,0 1 4,1 f S.O 1 






i I 2 \ % \ 




10 i 10 i 10 1 






|211 


3,7 1 ;4,1 1 jii .1 
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gx^mpj© 14 Synthetic shuf?!«d 10jR«|i!k« protease is^l-varlants with signal 

Anothsr synthetic tali vadant 10R protease encoding gene, denoted G -MAT-22, was 
constascted with a signal peptide, and the 8 amino ac^d C~temimal tail of SEQ ID NO; 3, and 
10 sntrocsuced Into a Saa/fe's subtsHs host for expression. A surprisingly higii ybld of protease v>?8s 
achieved (data not shown). The fuii coding DMA sequence of G-MAT-22 §s showsi in SSQ IP 
NO; 44, and the encoded pre-pro-protease is shown in SEO ID 1^0; 45. The G-mat-22 prDtSass 
Is an alpha-iytic protsase-llks enzyme (peptidase family S1E old notation; S2A), This protease: 
has a higher lemperature optir^^ym {at pH 9) than tJ^e 10B protssas®, as shown in Figure 1 , 

n. 

Example IS: Shuffled Pr€f'Seque«c©s of 10R-like Proteases 

Recombination of prelease g&ms can be made indepe?>dsntly of the spedfte sequence 
of the parents by synthetic shuffling as deseribsd in Nass, I.E, ^t ai 2002 [Natum Bidtsehnology, 
VbL 20 (12). pp. i251'^12SSi 2002], Synthetic oiigofiiudeotids^ degensfat^id: io thefr tMA 

20 seclueiic^ tis provide tl^ posslbity of a8 smlnd add^ fduhd in s# of pMmt prota^s^ ar© 
di^igrsed snd tf^>0 ganas assamfef^ aocoFdlng to tha rsferaftca; TJne ah ulfling qan ba earnad out 
for tfie fMli length aaquahca or for onfy prt of tha sequa^nc^ arid ther? later c©fnblnad #tts the 
rest of the f ene to give a fuN lersgth ^yence, 

Ir? this e^sarnpi^tl^eamliid acid ssqMencefbf tlie Pro-psp^d^ partof th^ pamnt prdtsfses 

2S givers in SEQ ID ND: 28; SEQ SD NO; 33; SEQ ID HQ: 37; SEQ ID HO: 41 ; SEO lb MQ: 43; or 
SEP !0 NO; 4S is ^ncpded by' a ml 0 oilgonuolaoMas asid the re$u!feg sh willed gana 
f ragrhsnts sra corhfoiried irito the cdntext of tha fiiil lastQth pfoteasa gene, which then c»?iststa pf 
dl#i coding for the signal sequence, the (shuffled) Pro-paptide, and In this case the mature 

8S 



proiein of 1 SR protease. Examples of sfiuffisd PFo-p^apficle seq«enc^s sre shosfsm m SEQ ID HO: 
m m^m, W ID MO; 47 (G^2,73). SEQ SD NO: 48 tS^143), SEO fO NO: 49 (m^M, SEO ID 
NO; 50 (<3^2.6J, SEQ 10 NO: S1 (G-2,% SE0 JP NO: 62 SHQ !D I^Or S3 (G-l 2). 

TSe ecsRtf^ste prpteass sBcodlngi genes ws!^ Inserted into genome of i3>y 
5 hsJrrsologpMS FieccsmfeinaiElOfi as d^scrifeed abova^ an?! the pmteases sxpir^sed In sHskeflasks 
using a rich media. Tiie fermsnMlon wa$ carried out for § dap «t 3d°C and fts MjMniatant 
isolated by ©eritrifygatlon |3*^ tHe profess^ M cant^^ B B.$t(MM Gi&m 

BxpMmlnQ '&m wM type protease 1GB fesm Noo^Miopsis $p. MRiRt 10262 frosti an fd^nlical 
qonstfiiction protocel; w^s ferm^tecl un<|ler tie s^r^e <50fKii8pn^, The protease adtMty was 
10 {S^louSat^d and is prasant«d in 8ie table b&\im relatively to the as^M^ cf the wild type 1QR 
protease. Cie^arl^ th© hetarblqgoys prG^regipr^s provids an advantage over the native pro-region 
of the 1 OR protease. 



Rel.acivity 


10R 


1,0 






G-1,4 


1,4 


B^S 


is 




3,4 


G~2;S 


3,0 




4,2 




1,6 



Table 21; Relative activity of 10R protease expressed with heterologous shuiiesl pro-psptides. 

16 Bcampfe i$Jrt vim momjpstdc peffesrmarjfee of tsll^vsdgint lOR^-HVi 

this exas'nf^'e describes a dose/fespohse study with the fowr amirio aciid t^ii variajiS 
HVi of the 10R protease Irv the mondgaslrife in \^ro model ustng Ids 2^ 60^ and lOQ mg EP/kg, 
8f5d using iOR pfoteas®^sbsn«:hmark oNpntro variant lOR-HVIwesoor^nict^ad^t 

have 4 amino aeids sxira In the C^ten^^lhus^dSAiP (SEO ID HO: S) as descriSSi^ above, 

8^0 /f? v#meondifjons: 



Sals^rate: 4 § SSy ^ 8 g make {premlsied) 

pH; %0Momachitep/SJ-7.Qin^$llnastep 

HCI: 0.1 05 M fbr 1 3 hours; (le. 30 mlh HCI-substrate premi)dng) 

Papisin; 3000 U /g dietfor 1 hour 

2 S Panersatin; B ftjg/0 diet for 4 hours. 

Incubated: 4'0°C, 
Replica; 
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Enzymes: 

1QR prolate: FFE-2003-0{mT; PmSMOO; 1S4 mq EP/g product 



Sfcjjution A: 1 0R l CKI BPM ^M; 

(1 EPML * 10 mL)f f S4 EP% prqdM 0.0649 g 
Pmpare 10 ml; Disolye 0.O84S g m^fmm in 10 mL NaAe feuffer. 



CI rrig €P/mL* 20 mL>S 370 mg EP/g pfociyct » a.0S4D^^ 
Prepare 20 tnL: Disdlvfe 0.G541 g enzyme in 20 mL N^c btiifer. 



is Soiytssil D: 10R-HV1, 50ma EP^g diet 

SO mg EP/kg diet '-0,50 mg EP/flask vis 1 ml ~ 0,5D mg EP/ml 

F*mparfe 10 inl: DifUt® C 2 times; S ml soiutioh C + S ml 12S mM NaA«>^b«ffer 

Soiufen E: 10R--HV1 2Sfha EP/kg mt 
20 2S ms EP^kg diet -0;2S mg EP/flssk vta 1 ml « 0.25 mg EP/ml 

^repsm 1^ ml; Dtlute C 4 tlfries: 3 ml solytldn C + 9 ml t2S my NaA©rbuff©r 

Sojution F: 10R~HV1, 1Q mq EP/ka diet 
25 mg EP/kg diet ^0.25 mg EP/flask vsa 1 ml = 0,25 mg EPfm 
25 Prspam 10 mL; Dilute G 1Q times: 1 mi spiutiqn G -^ B mi 1 2S mM MaAc-b>yf fer 

. Substrates: 

Pmmfx (40% SBM / 60% maize), FFS-200arOQf21 

The 1 0 g sample contains S g maize at^d 4 g SBM giving a calouiated protein content of 23.48 % 
3Q of profeir^ 2.3S g^'lask}. 



4m7 m mon.. at~i-oooo2/36 
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52,43 ml af 4.Q0S HCVfiil Wftfi miSO to 2000 mL 

10 HCL2 rHa/pes3Sjn) soSutton: 0.105 M cofvtainina 30000 U pe&$in/5 ml 
Prepare 250 mL 

Take out approx. ISO ml. from the HCi-soiuti-an, add S.18 g pepsin and ill up to 2SG ml with the 
HCI solution, 

15 1 2$ mM NaA c-buffer, J3H S.O: 

Pfepamd from ^ 2 y NaAc^btiffir (KLu 04-07-2003/lsb bdolc 14160 p, 104) 
12,5 ml 2 M Natebisfer, SI! upiQ 200 ml with mliliQ 

20 NaHGOrPsns^estln is p#insde* divided liito portions arid frozsn. Made 29-04-^2003 and frozan, 
it is sifei^y t^av^d in re^gerator met riight The stock preparation is described in lab. book 
141 6S pages QgS. 

as In tie Prernljcing phase 10 g substrate is rnfesesd vviti 41 m! i^Gi1| tilfen in ihs gastnai 

phase (t"30 min) S mi HCh2 (NGi/pepsIn) 1 mi en^^ym^ (or iju!?|r) Is added* and latsf {t»si h) 
Ite pH is mfessur^cl and 16 mi wMer ss added; and th^n In fje srit^stinai phass {t«1h 30 mln} 7 
mi 0,38 NaOH is added, and iater 11^211) S mi NaHCOS/pamreaiin Is added arsd th© pH Is 
rY?#sar©d again twlcs (t«?2h 30 wm & t*9h 30 mir^); sNd finally (t=s=Oh) SO mi sMSp^fislpr? m 

30 sampled lor ssrstfift^gatlon., feactn supema^nt Is immediateiy and oareftiljy removed from the 
Ger^rifuge tMbe Into gi^ss lybp. The sup#n^^ am spiit in po ^stiiqMs for further analysis, 
feuHis ate shown in table 22. 




Ta&te SS: Treatmerjt &f samptes si the meRogasS'fe m vMro mosiBl. 
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/kg diott 


Pepsin 
U/gi diet: 




1-5 


1 ml 
Buffsr 


Blmk 






3Q00 


8,0 


6-10 


1 ml 
SoSulton A 


(FFE-2003-00047) 


3.0 


lOOmg EP 


3000 


8.0 


11 - 15 


1ml 
SolytiofsC 


1SR-HV1 




100 mg EP 


3000 


8.0 


16"" 20 


1 rnl 
Solution D 


10R»HV1 


30 


S& nr^ EP 


3000 






1 ml 


10R-HV1 




25nigEF 


3000 




a,o 




1 ml 
Solution F 


1GR-HV1 
(PP.A22S73) 


3.0 


10 trig EP 


3000 


8.0 



§ Tha changes In levels of solufefe and digesHfoie qrude pret^n in the sdubie phase of 

th® liydroiysates ware d^fen^nin^ using m ^KTA HPLQ (S^erclgsx 30 peptide coiumn). The 
resufts are shown h TaNs 23, 

At a 10R-HVt do?ie of IQO EP/ikg <^fei |h® !@¥©l sjf DsgestsNs prd^n wss sSgftltently 
Increasecl by 9.8%, cornpsred id Biank, The cohtfoi 10R showed a t^eiafiVe improvement of 
1Q ?,?%o, iAHii ih© ioWer ^zyme con<iiBntliatlons (SO, 2S, :asid 19 mg £F*/H§ ^^fet) the relative 
impmv&r«es5its of Dlgestsbte prntesisi vs;8f6 5.7%, 3.3% and 0:7%, re^pecSveiy. 

Ta&le 23: MPLG results with 10R-H\/1 and 1SR showing ths psrcsntual dhang^s In digestiblsCiP 
and solubb €P relative to feiank, pifterent tetfef^ or? top of the fears Indiicate sigmficant 
is dtffef^rices (1 -AMOVA, Tyi<8y > ^5%). 
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The original 10R [100 mg EP/kg diet] Improved the level of soluble protein by about 4'%, 
The effects of 10R-HV1 was slightly higher (5.3% relative increase) and significant- With a dose 
of SQ an»:! 25 mg HP/kg diet tiie relative improvsments were 3,6% and 2,9%, respectively and 
signifioant. With 10 mg EP/kg diet tne reiettve trnpfove,wer!t was 0.5%, 

Decsne e of Hydroiysis: 

The degree of hydroSyele {DH} vsfa$ determmei using the OP,A method. Results are 



shown In Tabie 24. 





Of total proteirt 
%DH m 


Relative t( 


a blmk 
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100,0 
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liOR rFPE-2O0MQG47yi1QCr 
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77 iQ 
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106,0 
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0,36 ^ 107,7 




1,29 


IIORjy |SQ| 


J. 


27,34 




0,57 


105,6 




2,08 


l10:Rwi|2S| 


s 


26,42 






102,0 




2,16 




1, 








88,6 




3,76 



T8fe!0 24; pegr^ of Hydrolysis PH) cjeleifTtiitried fey OpA method; Absofufe as well m 
rdativ® Sifaliies am shown, differem iettefs indicate ^^gnltorit dlf^rerices (l-way ANDVA^ Tutey 

Tall-vf!isj>t iOB-HVI lnif>mye<^ DH Isy 7,?%, compared to Bl^nlc, With th^ lowef cipsas 
{60 a«d 2S m§ EP/kg <llet) of tie fsrofease the imisrovemenfts ranged from 5.S - 2,0%, 
F^pecti^ely, iti Srse with previous findings. At tlie Ibwefet <|os© 110 ifng EF/Kg diet^ ns effept wa^ 
seen. Tl^e oHginal 10R |iOO mg EP,^g dietl showed impfovemejits of 5% f#IMve to Blank, 
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The resyits af thf HLFC AKTA anaiysis the DH dstermlnstes dearly sHm^ifet 
Bsldillon ctf th« fouF amino acid (SEO ID NO: S) long tail to 10B do^ fsot affect tha perfefpanca 
of til© 10R j}rota^$t to ani' slgnificsrst e)st©nt. 
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1. A seomted mature polypeptide wNsfSh after ^m^^ has prqteas© activity, whsch polypeptsd© 
when expre$secl arid ijefore m^ttraSon comf3ns^& a hg^erplopous pro-regton, ancS wHict* 
polypeptide; 

S (a) compds^s ah amffid add ssc^etlce v!i?h!ch h st lM^ 70% Meimtlcal to S^s aniipcs mM 

seeiusfic© of the mature part of the pdypsptfde shown in SECi ID MO: 28; SEQ 3D 
MO: 33; SSO © M0; B7; SBQ iD !SfO; 41; SgQ iD N0:4a; or $EQB M0; 4Sj 
(b) Mmpnses afi amir?© acitl sequence wNdh i§ gtt]es$t70% Idsi^Scal to th© amfeo add 
seque!ic;0 of the mattife part bf the polypeptide encoded |>y the ppiyfiMdeafide In 
10 SEQ ID NO: 1; SEQ ID NO: 2; SEQ ID NO; 25; SEQ ID NO; 31v SEQ iO NO: 32; 

SEO ID NO; 36; SEQ ID NO; 40: pr SEQ IP NO: 44; 
(e) comprises a mature p^rt which is a Varlaht of the mature part of tile pdlyjseptids! 
having tie amirjo acid sequence of S£S ID NO; 28; SEQ ID NO; 53; SEQ ID MO; 37; 
SEO ID NO; 41; SEQ ID NO; 43j dr SEQ ID NO: 45, the s^ment comprising a 
i s substltutbn, deletion, extension, m&or In^lpn of one or more amino acids; 

(d) is an allele mtimi 6ftn% M or (c); 
{# Is afragment of (a), (fe), {<?), or |d), 

2, The poiijf^ptfde acofrding to daim 1 , wHsreln the hetsroioigoMS pro-region is- derived 
26 tern a protease, preferably th© pro-regbn is derived from an S2A or SIE protease, and rnost 
preferabiy jtlsat least 7Q% idenScal to the pm-mgion shown in SEQ ID NO: 28, SEO ID NO: 30, 
SEQ ID NO: 33, SEQ ID NO: 37, SEO ID HQ: 41, SEQ ID NO; 43, SEQ ID NO: 45. SEQ ID MO: 
47, SEQ ID NO: 48. SEO ID HO: 49. SEQ ID IslO: ^ SEO ID NQ: 51* SEO ID NO; S2, SEQ ID 

mm. 

26 

3, The polypeptide according to claim 1 or 2, which is a wildtype polypeptide, en aillficigi variant 
of a wilo!t^'pe poiypeptide said variant iiaving one or nii-^re amino~add(s) added to the C-tem'ifnus, 
as compared to the wlidlype, a siluffied poiypeptide, or a proteitvengineered poiypeptide. 

m 4. The poiypeptide acdofdlhg to any pf dalms 1 ~ % mm eomprises al ie^ three non-polar or 
uncharged polar amino acids within the last toar ambo aelds of the O^terminua of the 

polypep^de 
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5. Th® poiypeptkl® according to clasm 3, vsherein tile one of motB added amino ficld(s) is (ars) 
oof^rpolaF «r oncbargsd. 

g. The pQlypsplitfe accorrfmg to daim whsmb t?a 0^0 errjsioreadd^d amino aeidCs) Is 00=© or 
s rnom of Q, % V, A* or P. 

7, The polypefjtid^ aecoitljng to dmm 3, yieirein tie an© or imi)f:® ^dded sfmifie adds ap 
selected from the group comisthig of; QSHVOSAP, QSAP, ap;TL, TT. QU LP, Tl, ta, QP, 

IS 

8, Tha polyp0pt3da aecordfng to ar^ of alaims 1 ^ T wbfeh wh^fs expmssed ^nd before 
maturatioh comprise a hat^dogaus seo-etiori slgnal^p^ptid& which b cleaved from the 
polj^ppMde When th© poiypeplid© Is ^^eted, prafer^^bly tfje hai#dbgGu& ^cmtion signal 
pejptids Is dsrived frdm a hetemfogbuis protease. 

IS 

9, The polypeptid© accdrdlng td daim Bs whemin the heterologous sseretisn signal peptide 
csfTjprsses an amino aoid sequenoe having a ssq; ranee identity of 3! least 70% unth tha amino 
acid secfuehos encoded by polynudeotides 1 - 81 of SEQ 10 HO: 2( or SEQ ID MO: 44. 

20 1 0. Arj isolated polynucleotide encoding a polypeptide as defeed In any of claims 1-9, 

11, A rscomblnant expression vector or polynucleotide construct comprising a polynucleotide as 
defined isidalm 10, 

26 12, A rscombinanl host ceil comprising a polynucleotide as dsfeed in ciaim 1 0, csr an expression 
vector or polynucleotide construct as defined in ciaim 1 1 . 

13, Thss recombinant host eel! accotxiing to cistm 12 vvWch is a Sac/te cell. 

30 14, A tirarisgenfe piBtt^ or plant pari, corrspflsing a pofynudi^dide as defined In dalni^ or ^rt 
e)i|>res»n vector or polynudieotfde iconstirud a^ defined In claim 1 1. 



7? 



1 5. Atransgenic, rjoft-human ammal. or produois, or eteents Iheneof, comprsssng a 
pofyntJsleoticje as dsined In fiSaim 10, or an expression vector or polpMC|«otl<ie oonsfejct as 
defir^lndaimli, 

S IS. $ ifttsth# for jprodaeing a in any of ;<?Nim$ 1 » 9, fv)eth(3>d 

cs^mprisirsg; la) osifclnga rwombinantho^^ claim 12 or 13, sr @ tmr^sgenlo 

pi^tii or ^fismal as cJi^fih^ sn c!afm$ 14 sr 1 S, tp prq<SM<?e ^ $upeffiatant isdnii^nslns the 
pQiypaptfsle, and optlons^iy <b) reoovermg the polypepticte. 

1Q i f. An mm&\ feed addlfiv© ostiiprising at Isasl om polypeptide as defined in any of Glaims 1-9; 

(M) at least on® fat-soluble vitamin, an^J/or 
(I?) at; le^st one water-soluble vitamlhj and/of 
(c) at l^st one trace mineral 

is 

18; An animal feed composition ftsving-a c^udfe protein content of 50 to 8CK3 gA^g and comprising 
at least one polypeptide as dsfiheid lh - S, or at least one feed sddilive of eiaim 

17. 

'M: 19, A composition comprising at feast one poiyjieptide ss definsd in any of daims 1 - 9^ together 
with at least one ottisr enzyme selected from amongst phytase (EC 3,1.3.8 or S'1>3-2e):; 
xyianase (EG 3,2.1,8); galactansss (EC 3.2,1.39); aiphs-gaigctosidsss (EC 3.2.1.22); protease 
(EC 3.4,-.-), phosphollpaseM (EC 3,1.132): phospholipase A2 (EC 3,1.1.4); (ysophospbolipase 
(EC 3.1.1. S): phospholipase C (3.1,4.3); phos^olipase D (EC 3.1.4.4); and/or Ijela-glucanase 

25 (EC 3,2.1.4 or EC a.2.16). 

20- A method for using at least one polypeptide as defined in any of claims 1 - 8, for improving 
the nutritiona! value of m animal feed, for Inaeasing diges^hl© snd/or solubls protein in animal 
diets, for irsoreasing the degree of hydrofysle of proteins in animal diets, arxf/er for tiiOi trea^ent 
39 of v|getaljte proteirss, the m^dd comprising incfuding the polypept!ds(s) in |himai feed, and/or 
'm a composition for use in anima! figed. 

21. A method for tishg at least one polypeptide as deined In any of claims i 9, oompfMrjg 
ineliidiiig the pdiypeptideCs) in a determent formulation. 
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10495 . 2Cj4-~W0, ST2S .txt 
SgQUENGE LIST.t^^G 

<:iiO> Lassen, sorer? Flensted 

<12Q> Xmpmmd fusion prsteases m4 fsethods for prndycing thMs 
<130> i<54S5,264-WO 

<17Q^ P&tmtzn version 3,3 

<m^ 1 

<2ii> 1062 

<212> DNA 

<21$> Nocar^fopsis sp, HSRL iS2S2 

<22:i> sc....f eature 
<222> CO s V C49S;) 

<223> Encodes the prQ«r«g1oEi shown in positions -16S to 6f SSQ ID 

<220> . ^ 
«;221> fR-iscfeature 
<zn> (4%) . . (10S$) 

-<223is- sncedes tlis sssatyre region shorn in positioiis of S£Q W 

NO: 43. 

<400^^ 1 

gctactggag cattacctcs gtcteetaca cctgaagcag atgeagtatc gatgcaagaa 60 

gcattacaac gtgatcttcia tcttscatca gctgaagctg aggaattact tgctgcstcaa 120 

gatacagcct ttgaagttga tgaagctgct gctssagtag ctggtgatgc atatggtggt IBO 

teagtattcg ataetgaatc actcQaactt actgtactag tgaccgatcic agcagctgtt .240 

gaagctgttg aagccacagg tgc&ggtaca gagctcgtat cttatgigtat: tgatggatta 300 

gatgagatcg tacaaQagct tatatgcagct gatgccgttc caggtgtagt tggatggtat 3S0 

ectgatgtag caggtgatac tgttgtctta gaagttcttg aaggctctgg agctgatgtt 420 

tctggacttt tagcagacgt aggagtcgist gcatccgcgg ttgv^sgtgac cacgtcagat 480 

cagcctgaac tctatgccgs tatcattgga ggcctagcgt acacaatggg tggtcgctgc 540 

agcgtaggat ttgcagccac aaatgcagct ggacaacctg gcttcgtgac agctggacat 600 

tgcggcc<|cg tcggt.acaca ggttactatc ggcaatggaa gaggtgtctt tgagcaaagc 660 

gtatttcccg ggaatgatgc tgccttcgtt agaggtacgt ccaactttac gcttsctaac 720 

ttagtatct^t gatacaacac tggcggatat gcaaetgtag Gaggteaeaa tcaagcacct 780 

attggGtcta gcgtctgccg ct<:agggtcg actacag^at ggcatt^tgg aaceattcaa 840 

gctagaggtc agagcgtgag ctatcctgaa ggtaccgtaa cgaacatgac tcgtacgact 5)00 

gtatgtgcag aaccaggtga ctctggaggt tcat&tatca gcggtacgca agcgcaaggc 960 

gttacetcag gtggatccgg taactgtagg acaggtggca caacgttcta ccaggaagtg 1020 

acaccgatgg tgaa*;tcttg gggagttaga ctccgtacat aa 1062 



loms . a04-w, ST25 ;txt 

<210> 2 
<21i> 1X43 
<212> OH.A 

<213> Artificial sequence 

<2M> A svfJthetic lOR gene (XOnsynt-15") encodlnq a S2A protease denoted 
"10k" fused by pcr in frame to the signal peptide enco«il«g 
sequence of a hsterologous protease » Savlniase* 



atgaagaaac cgttfgg gaa 


aattgtcgca 


agcaccgcae tactcatttc 


tgttgctttt 


€0 


agttcatcga tcgcatcggc 


tgc'tactsS'^ 


geatt^acctc agi;ct;<;ctsc 


acctgaagca 


m 


gatf cagtat cgatgc^sga 


agcattacaa 


cgtgatcttg atcttacatc 


agpgaagct 


ISO 


gaggaattac ttgetgcaca 


agairaeagcc 


tttgaftgttg atgaagctgc 


cgctgaagca 


240 


gct§gtgat§: catatQgt^Q 


ttcagtatt^ 


gatactgaat. cactega^ct 


tactgtacta 


300 


gtgaccgatg cagc^gctgt 


tgaagctgirt 


gaagccacag gtgcaggtac 


agagctcgta 


360 


tcttatQgta ttgatsgatt 


^atgagatc 


ytacaa^agc ttaatgitagc 


tgatgcegtt 


420 


ccaggtgtsg ttggatggt^ 


tcctgatgta 


SGaggtgata ctgttgtGtt 


agaagttctt 


Am 


gaaggctctg gagctgatgt 


ttctggactt 


ttagcagacg caggagtcga 


tgcatccgcg 


S40 


gtt@aagtga ccacgtcaga 


tcagcctg&a 


ctctatgccg atatGattgg 


aggectagcg 


600 


tacacaatggi gtggtcgcts 


cagcstagga 


tttgcagcca caaatgcagc 


tggacaacct 


660 


ggcttcgtga cagctggaca 


ttgcggccgc 


gtcggtacac aggttactat 


cggGaatgga 


720 


a#ggtgtct ttgagcaaag 


cgtattttcf; 


gggaatg^tg ctgccttclt 


tagaggtacg 


rm 


tccaacttta cgcttadta^ 


cttagtatct 


agatacaat^ ctggcggata 


tgcaactgta 


840 


gcaggtcaca atcaagcacc 


tattagctct 


agcgtctgcc gctcagggtc. 


gactacagga 


900 


tggcattgtg gaae£:attca 


agctagatj^t 


cagagcgtga gctatcctga 


aggtaccgta 


960 


acgaacatga ctcgtacgac 




gaaccaggtg actctg^agg 


ttcatatatc 


1020 


sgcggtacgc aagcgcaagg 


cgttacctca 


ggtggatccg gtaactgtag 


gacaggtggc 


lOSO 


acaacgttct accaggaagt 


gacaccgatg 


gtgaactctt ggggagttag 


actccgtaca 


1140 


xaa. 








1143 



<21Q> 3 

<211> 8 

<212> PRT 

<213> Artificial secjuence 
<22a> 

<223> c-terfsinal aarfiiq aeid tail expressed as fusion to protease of the 
invention. 

Gin ser h1s v'ai &in ser Al a Pro 
1 5 



<Z1Q> 4 

Page 2 



CJ.ST2S,t.xt 
<2i?> DMA 

<213> Artificial s&quetice 

<ZZQ> 

<223> Polvnu<;1eoti4e encoding a c-termi nal asstno acid tall expressed as 
fusion to pratease of th« liivertf jsn. 

<40D> 4 

caatcgcatg ttcaatccgc tcca 

<11Q> $ 

<21I> 4 

<2X2> PRT 

<213> Artificial sequence 

<220> 

<223> c~t:erm1nal amino acid tail expressed as fusion to protease of the 

Gln ser Ala pro 
1 

<210> $ 
<2X1> 12 
<;212> DMA 

<213> Artificial ssquenee 
<22G> 

<723> Pol^'mi€\mti4^ mcoMng a c-terminal assino acid: tail sxprssssd as 
fusion to proteass of nhe invention. 

<400> 6 

caatcggctc ct 12 

<m^ 7 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> C-tersilnal asHno acid tail expressed as fasioti to protease Of the 
Invention. 

<4Q0> 7 

Gin Pro 
1 



<21D> 8 
<2il> 6 

<2:i2> om 

<2i3> Artificial seqyencs 

<22Q> 

pal ^'nucleotide encoding a C-terennal asnlno ac14 tall exprsisssd as 
fuslssn t© protease of tt1« IhyeMioh, 

<400> t 

caacea 6 



page 3 



imm . ,?04~M>, sras . txt 

<2ii> i 

<<^.l'i> Arfificia'i sequence 
<22a> 

<223> c- ui-nffjnal afirino actsi taH expressed as fusion to protease of the 
invent} ort. 

1 



<210> 10 
<2i2> ONA 

<213> Artificial sequence 

<223> Polynucleotide encodinq a c.-tenrdnal asHno acid tsil ^xpr^esssd 
fusion to protease of the invention, 

<4m> 10: 



<210> 13. 

<2i2> MA 

<21$> Artificial sequence 

<77M> 

<2i3> Pf ijser #252638 

<400> ti 

catgtgcatg tgggtaccgc asegttcgc^ gettgctgctg aagag 45 

<2io> la 

<212> DNA 

<213> Artificial sequence 
<32Q> 

<4QQ> 12 

eatgtgcatg tggtqgaccg att^tggagc ggattgaaca tgcg 44 



<210> 13 

<213> 44 

<212> DNA _ 

<213> Artificial seciusnce 
<220> 

^223> Primer :i:i?SS4i 

<400> 13 

gcgttgagac gcgcggccgc gafcgcegtt tggctfaatg atac 



<210> M 

■ai}> 43 

<;'.:l?> Oi^a 

<2i3> Artificial seQoerice 

Page 4 









<4QQ> 

gc.gtr.gi 


3.4 

igac agctcga§ca gggaaaaatg gaaccgcttt ttc 




'C/? > 

<,•' ! j:> 


Arf i f i ci al ssqtsirtce 




<223> 








jatc agaattcact ggccgtcgtt ttacaaccat tgcggaa? 


mt agtcataggc 








<210> 
<211> 


60 




<2X2> 
<233> 


Arti f 1 ci a1 sesi^Uenca 




<220> 
<223> 


Primer : #i?$S40 




<40O> W 

ggatec^sat ctggtacccg ggtcta^agt cgacgcggcg gttcgcgtcc Qfacageaca 


<210> 

•<2ii> 

<2X2> 
<213> 


i? 

37 

SNA 

Arti f 1 c1 al sejiluence 




<223> 


Primer #|?S154 




<4C>0> 1? 

gttgtaaaac gacggccagt yaattctgat caaatgg 




<210> 
<2ll> 
<212> 

<2;i3> 


M 
37 

vm 

Arti f i ci ai ss^juasice 




<220!> 
<^23> 


Primer #2.7&3;33 





3? 



<400> IS 

ccgcgtcgac actagacacg ggtacctgat ctagatc ^7 

<210> 19 
<2ii> 22 
<2i2> D:NA 

<213> Artificial seciuence 
<22<fe^ 

<223> pHmer #317 

<4W}> 19 

tggcgcaatc gg$^s:cstgg 12 

8 5 



<21Q> 20 

<211> 40 

<212> Dm 

<2X3> Artificial sequence 

<22l> priffter #B9 mtt 

■<400> 2S 

cstgtgeatg cggccgcatt ascgcgttgc cfcttctgcg 40 

<210> 21 

<2xl> ?443 

<2;i.?> DNA 

■s:213> Artlfieial sequence 

<220> 

<<^?.:4> s<-qu«nce. of plasmid pMsXSOS 

<400> 21 



tcgcgcgttt cggtgatgac 


y \j 4. ^ ]R t» » ^ 


■ .^yK^ M.^ 0*-^ 






vvf 


cagcttgtct ^taageggat 


W Vj^ \J-£t VJ ^vM 


?^ ?3 fi f*f" irs 

. V. K « V. \a Vst^ 






















aceatatgcg gtgtgaa&ta 






a fS J? ^ 55 V" 


<a v.vnyy v.y vv. 




attcgccatt caggctgcgc 


a art" St" QfS£j 




yy v'S^^sjsaw*-^ 








ggcga^aggg 






t. !t>y yy t> a 






ttteccagtc acgactttgt 




r r s iit' ?s 1^ 








ttgcaattga 


agaattatta 


atgttaaget 


i-aatt&aaefa 


taatatcttt 


gaattgtaac 


480 




agtaagSact 


acSaaaaaag 


Mtac^ttat 


at^^aatat 


gtttgaacct 


S40 


tcttcagatt acaaatatat 


tcggaeggaG 


tGtacctcaa 


atgcttatct 


aact^ata^aa 


600 


tgacatacaa 


gcacascctt 


gaaaattt?55 


aaatataact 


ao:aatga«c 


ttgttcatgt 


660 


gaattatcqc 


tcjtatttaat 




^^aati-ava^t^aa 


tatgccaata 


cattcjttaca 


720 


agtagaaatt 


aagacaccct 


tgatagcctt 


actataccta 


3catgatgta 


gtattaaatg 


780 


aatatgtaaa 


tatatttatg 


ataagaaycsii 


acT'ca'^i'ii'iiati 


aatcattaca 


tatttttcta 


840 


ttggaatgat 


taagatt cca 


atagaatagt 


gtataaatta 


tttatcttga 


aaggagggat 


900 


gcctaaaaac 


gaagascat t 


aaaascatat 


atttgcaccg 




ttatgsaaaa 


§60 


tcattttatc 


astttgaaaa 


ttatgtaTta 


tggagctctg 


aaaaaaagga 


gai^gataaag 


3.1520 


aatgaagsaa 


ccgttgggga 


aaattgtcgc 


aagcaccgc^ 


ctactcattt 




ioso 


tagttcatcg 


atcgeatcgg 




agcaaaagaa 


aaatatftaa 


ttggctttaa: 


1140 




gctgtcagtg 


agtttgtaga 


acaagtag^S 


gcaaatgacg 


aggtcgccat 


1200 


tctctctgag 


gaagaQgaag 


tcgaaattga 


attgettcat 


gaatttgaaa 


cgattcGtgt 


12i0 


tttatccgtt 


gagttaagcc 


cagaagatgt 


ggacgcgctt 


gaactcgatc 


cagegatttc 


1320 


ttatattigaa 




aagtaacgac 


aatggcgcaa 
P a ge 6 


tcggtaccat 


g§ggtata:te 
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<" J"f f ' ^1: 






gcagatctgg 


gaccaataat 


s t' fi ?5 t" *^ n 


"5 44 0 








aaggaacgaa tattggataa 


S3 fS"$" f J^J f5 ?J t ^J^J- 

j-vsj ^. y vj v> 


.1. J \j\J 


tv t-, I. St St El W V-GS. 




^ Sf f| t* » t' 


attgactttt 


aaaaaaggat 


li St t. V. t> St St ti. 


IS SO 


^ ^ ;^ ;^ n ^ r ;-J 






tcactttaga 


taaaaattta 




A; V> i V 


5i ?i 1' n ?i A ^* 






aattggaaga gaaaagagat 


vi f t* t* 54 a f if^Sf f 




*T a f t* t" J* 




t'Tf ^1 t* ^ i" 


accacagaaa 


ttgatattag 


t" fi f'i' t* ;^ t^ c 


1740 








taccctgcat ttattttc^t 


33 f?^* flo3 S3 S3 f5f 5 








u <u V 


SSttaj-aata ^cgacggaga 






i^'f jti* 






tttgatggtg 


tatctaaaae 


>* ft' i^f i^f rtrtf : 




is 4**^ A* >«. M 21 ^^iT^ 

<s. V. y <*■ V Si^*^ 






gagt:tt;'ta!fcg 


atttatactt 






rts^^i^is* S3 Si ^' jj t^ii 






cccMaacac 


ctatacctga 


S3 ■j' rti^'t' "J^ "t* "i* 
Awvi. .V V V 




w V WU.I. t. usv 


"t" si ■4'"4^ j^* ^'■■rt 




act^ggttta 


actt'casai^ati 






3i^^f'35 51 ff JS iT^ 




■ f S 'S*"f* "vl if* Si tf^ SS 


gg^saattca 


ttaataaagg 


f 33 f f V*5!f is f 34 




f 3f t* f <^ r^^f 

V, v v 


'fr' a i^i'i' K 54 




tctgtttgtg 


atggttatca 


f fif^ ?J r* fs ?i f f fs 






iT'f j* f f r'Jfti^WiS* 




aggcctaats 


actggctttt 












tttct^acga 


iacattaata 






J$ J!l 51 f f ■t^ : 






taccaataas- 


titaagggtaa 














aggascgatt 


1" J$ is 1* f fS i^i 


>4*50 




i^r^i" a <^ ^ ■J^i'i'/*' 


Si SS 3!(\SJ n t' ?^ j5 j5 


atcsgctgga 


ctaaaagggg 


••^ >'> J-"* -a >-i ii >-> Si 








;s » it ^'^ ss '3 f{ f 


ataggaggag 


aGC"£at^t;t;at! 






SI Si J5 -if" iT? f" 55 n 




55 <^ 55 i^SJSS S? fl 


cl'.'tQT.'tcfciat: 


tgct'feat.tac 








t'jT'^l'^^'^t t I" -i >' 


J" t"^"l^"^^■/'■'1" f' f"' A 

t, i^. t^. U V, 


tccggacttg 


aacctaaagc 






^.ir. ^ » -J -tv -f. J* 

!v ^^ l^t-i^-tl i, U U 


5„ c y C >v y ^l-tiC. 




tttstctcca 


agcccttgcc 


;5 1^ "J* i"'^ >'■> 








n riT r r n r ^ 


ct.aact.ggaa 


cactatcaat 












cttatcgtta 


tcgcattctt 


f y5 1* f t* f " iT" n n 


C s.i \.{ \.f 








atttc9tat8 


tattcgttca 


f i-i i^J 'J" 'j' t"^ f i fJ & 

<^ t. V J;; *A 




CI Si « W K* fit V" 


t* t" n f 


i^-ihit t*"f ^"5! f^f n 


ctattcagtg 


atttaatsct 


I- t.\..st \.. \.. t>yv.. t^ 






■Sx V, tv V- y ^>-J 


jj ^ 5if ** y *^ 


attst;a£££c 


ctattatcag 




3060 


«^<S.<3.<4. VOL 


rf 1" .'s f J r'tf^if^ 

^ SI t. S.. SX 5a y VVa 




acagagagaa 


aaateggtgc 


?J f f <*^f f 5i f f J4 




aaaaccggtt 


ttcaggggaa 


tctgatcaca 


tGtgctatgt: 


tcct^gacagc 


gat|gGggc0 


nm 






ggcccatgat 


gtccicapgpg 


tg^acttaac 


atggacaagc 


3240 




ccgcgattgt 


accggg&ett 


gtaagcttaa 


tcatcacgce 


gettgtgatt 




tacaaactgt 


atcegccgga 




acaccggatg 


cggcgaaaat 


cgcaacagaa 


3360 


aaactgaaag 


aaatgggacc 


gttcaaaaaa 


tcggagcttt 
Page 7 


ccatggttat 


cgtgtttctt 


3420 



WmS. > ST2S < txt 



t' t' Oi'lT O C T t'ti 


1^ n r f' 0 f ^ 5 n s 


t" t" t* firs n r 


agcttcaaca tcgacgctac 






ttc5at.cg9t.t~ 






caagf uctga 


cttgggatga 


1:" C A A O A A A 


3540 


Vw- sv ^ V* >4 


ctt'ocssatsr 


crtracf Tcsft 


tttgcggcgc 


ttgtcatgct 


r fi r r ?^ c T c 


v-< t,' W W 


ttoaa-toaat 


1" i-i 0 tjC H 1' n T 

V -t-v ^; jv;; i-^ tij ^-J V 


fjt ri-iVK!'i "f t' 


agtaatQcca tgaaatcatc 


IT tii' ^'t f Pi f ! n 




tiitictcttEsaa 






attgttgtcgt attattactc 








rcss c&n r r 






ca.t:ttttggc 












ctgagcctcg 


cgttcatmg 












gctceggtct tcttcfgate 


JS /J/^/^^ SS S3l 4^ 






S? Si flfj l^fjfj Tf* 


*^ vy 5?** 


^tcctgtega ttgttcatat 


J** 5i "i* ^*'fTf<*2a "t^VIFrt 














<i<S.<i<i^^^^<Jly fir 












at;Qigiattt;'td 














cg§ttggtg<: 












ataatggggc 


tgattgeggc 




















Vtf « W U 'M'M M 44 4>s 






agaaccattt 


cctatatgGc 


fi/"^'' n rs ^ 9 






54 fliSf i1 £i 






aagaggtcat 










i^rti" is "f" f" ^y* 




aaaaaggaga 


Vr» V l> vy s<i.«i^ V 




V- Iw K» b^vfvtu 








gagastatttt 






U V.A^ V.'Kt Ills} V V k 






cgaiig&^tga 




^ f4 /" a "S^ "fr" 






;^ ;^ ;^ f"? A t 

iAttM V> \^ <.* « 


f3i" t% fff"t^ 


atcggcatgcg 


tgtctgttga 






« vf ».v \jf v,s \, v,s y 


c,% y \i, ^..c>.X'' 


■ ■T'ii sj Sis ss^^f^i* "t^ 

'iVf K% m ^ ^-flxy .V,<^ 


ttgagaaatt 


tgagtgtgat 


i' f'^f' S" fi t"/} '^Sr^if' 
i-y V Uy vvVj V 








V Sjf V. V J V V VJ w 


gtgctggcga 


aaagcatcag 


^ A ^ /7*TS 't*' 'R i^'iT^ 




^ **■ ^ >j ^ ^ ^ 


\A- Vt^VOXv V* \< K< 


iCs A tr r 1^ 


ctat.?stcgt:g 


agaggaacgc 


SXi.t "v V V VV- 




oraattrosfi 

V\ v< v< V» xs >^ 


aasaoatta !' 

V-\xA.V^ v-^ V'' K.'S.^. <*. 


Vj v V 4.^ >s. >v"W'Vi' V. 




tcgtcaccat 








ySGStOCt' 0 

■t^'v. V^'fr.x -O^J.-^- x-. V.J x.** 


rt" 0 r r r 


cctgtgatcc 


atcttcctat 








^- 0 f > f ' r f ' f J t ■ 

V. '.s. v,^ \j \j ^. y. 


y Vs- U vt 4.vjXw* V. V. 


gaaaaaggag 


aaatgctgcc 


^ O « v.- V. « vjf « « 






Q-Vv-Vf Kl li. V. SS «M y. 


^ -X. ^ ^ Sv ^ ^ « 


aatacgaaag 


tgiitgaatcs 






C3 rot' Pi T 0 n iliT; 






aaaacagagc 


tgaagaagct 


?S i^^tA ("A ^A 




bv- u y ict y V; 








agactcatga 


j3*j*-t^+"+"/"S!: a ^ "^^ 

fiVU C V V v&titi t.- 




aagccttatg 


cgattttagg 






tgcaagcttg 


gcgtaatcat 


3220 


ggfccatagct 


gtttcctgtg 


tgaaattgtt 


aticcgctGac 


aattccacac 


aacatacgag 


saao 








cctaatgagt 


gagctaactc 


acattaattg 




egttgitgctc 






gaaacctgtc 


gtgccagctg 


cattaatgaa 


s#o 








gtattgggcg 
page S 


ctcttecgtct 


tcctcgctca 


S460 



ctgactcgct gcgctcggtc 
taatacggtt. atccacagaa 
agcaaaaggc caggaaccgt 
cccctgacga gcatcacaaa 
tataaagata ccagfcgttt; 
tgccgcttac cggatacdtg 
Qctcacgetg taggtatctc 
acgaaccccc cgtteagccc 
acecggtaag acacgact:ta 
cgaggtatQt agigcggtgct 
ga^g^acagt atttggtatc 
gtagctcttg atccggcaaa 
si^ca^at'tac ^c^ci^gs^aa 
ct^aegctca gtg^aacgaa 
ggatcttcae ctagatcCtt 
atgagtaaac ttgstctgac 
tctgtctatt tcgttcatcc 
gggagg^ctt accatctg^ic 
caccggattt atc,a@c;aata 
caaetttatc cgcctcotc 
cgccagttaa tagtttgcgc 
cgtcatttwg tatggcttca 
cccccatgtt gtgcaaaasa 
agttggccgc sgtgttatca 
t g c c at ccg t sag at g cf 1 1 
agtgtatgcg gcgaccgagt 
atagcagaac tttaaaagtg 
ggatcttacc gctgtt§aga 
cagcatcttt tactttcacc 
eaaaaaaggg aataaggpcg 
att att gaag cat-tt at eag 
agaaaaatas acaaataggg 
aagaaaccat tattatcatg 











SS20 


T c <^ nnr^ A t' a 


V4 N \„- « V. J S* '-t " 




fi r a a a a Of! rr 


5580 


aaasaocKxg 




gtttttccat 


aQgctccgcc 


5640 


s t r o r of 


>^4A^-y Sly 


a^fSf iroa 3 a r 

VJ vj >.t iA. V 




S700 






a cacfcct cct 


Q't'tccgaccc 


S760 








ctt;l"ct;eal:a 


5820 






Gt Gcaactct £f 






















rfS i*"?" A ifj A 






■f jfi S5 K fS fS 


W W W b v4 WXv CvKtXv. 






















£i T f £i r A r 












6240 


3 54 TfSiii^£3^T 








6500 




0.0, V^j^C^i V^ ^ 




?i ^ ^ f5 't' a a 1* 












^420 




S^fit V. V. X:.V.V.4^')L4.'' 




3 'f' !fit n 5i l^si 


64SO 






<^ 44 V4 r4 xA Ihr ^ 4^ Vt- . 




6:540 


v.- ^ M U Vi.V.'CaC^ 






at n t" O £J t" i!' t:n 


6600 




. j4 ij, .J. A. .« -V 




f 1 1 ' 5S Si ^4 ''"^ t^'T 






r r 5J T T fj^'t* J? C 






6720 


V \. ^-V »»J V. '^.VJ 




j.c t.. AvA!:^!.) Y^.t^ti- 




6780 




V» V* V- V> V-\-lVj Vw w 






6840 


r t" r A f t 








6900 




dtdad'tftcS't' 




j. jj.,, J., 


5960 


t n r 1* r 1* Ta c r 


'■- wy*->* <-*->■<■■«<■ 






7020 


rt est cstTCJ 


csaaaacQttc 




asactctcaa 


70S0 


r r J5 ^5 1 1 c o i3 










agcgtttctg 


ggtgagcaaa 


aacaggaagg 


caaaatgccg 


7200 


acacggaaat 


gttgaatact 




cttttteaat 


7260 


ggttsttgtc 


tcatgagegg 


ataGatiattt 


gaatgtattt 


7S20 


gttccQcgca 


catttccccg 


aaaagtgcca 


cctgacgtct 


7380 


acaftaaect 


ataaaaatag 


gcgtateacg 


aggccctttc 


7440 
7443 



1043 5, 204 - W . ST25 .txt 

<210> 22 
<211> 5718 
<212> ORA 

<213> Artificial seqyefjce 
<?.20> 

<213> Ssqumca of ^BlSiO gertonvic integration region 
<400> 22 

gagc^ccgtt tggctgaatg atacaacast etcacttcct tsetgcgtct ggttgcaaaa. 60 

acgaagaagc aaggattccc ctcgcttctc attt^tccta tttattatac acttttttaa 120 

gcacatettt ggcf ettptt tcactagact tgatgectct gaatcttgtc caagtgtcaG; ISO 

9S*cc9eatc atagacttgt ccatttttca ccgctttgag atttttccag agcgggttcg 240 

ttttccactG atctacaat9 9ttttge<:tt cgttSQCtga at^tcagg^^t 300 

c:g®ttttgct caattg<;tca aggctgacct cttgataggc gttatct§ae; t|:ca^^ 360 

gtgtaaagcc tagcatttta akgatttcte cgtcata^ga tgatgatgta tgaagctgga 420 

aggaatccfc tettgcMcg tcgagaacp tgttgcggtt ttcatctttc ggaagti-cgcf 480 

Cttttagatc ^ttgatgafct tttttgtpct cggcaagctt ttcttttect tcatcttctt S40 

tatttaatgc tttagcaatg gtcgtaaagc tgtegatcgt ttcgtcatat gtcgcttcac 606 

ggctttttaa ttcaategtc ggggcgattt ttttcagetg tttataaatg tttttatggc 660 

gctcagcgtc agcsatgatt aaatcaggct tcaaggsact gatgacctca agattgggtt. 720 

C£?ctgcgtgt gectacagat gtgitaateaa tggagctgcc gacaagcttt itaatcatait 7W 

ctttttt^tt gteatct^cg atgeccaccg gcgtaatgcc gag^ttgtga acggcatcca 840 

agaatgaaac c-tcaagcacra accacccgct taggtgtgcc gcttactgtc gtttttcctt 9G0 

cttcgtcatg gatcactctg gaatctttag actc^ctttt gccgcttccg ttgttattcit .960 

ggct.t.gatga acagccggat acaatgsggc aggcgagcaa taaaacactc atgatggcaa 1020 

tcaacttgtt agaataggtg cgcatgtcst tcttcctttt ttcagattta gtaatgagaa lOSD 

tcattatcac atgtaacact ataatagcat ggcttatcat gtcaatattt ttttagtaaa 1140 

gaaagctgcg tttttactgc tttxtcatgs aaocatcatc agacacaaat. aagtggtatg 1200 

cagcgttacc gtgtcttcga gacaaaaacg catgggcgtt ggctttagag- gtttcgaaca 1260 

tatcagcagt gacataagga aggagagtgc tgagataacc ggacastttc ttttctattt 1320 

catx-tfttag tgcaaattca atgtefeega tatteatgat aatcgagaaa acaaagtcga i3S0 

tattgatatg aaaatgttcc tcggcaaaaa ccgcaagcte gtgaattcct ggtgaacatc 1440 

cggeacgctt atggaaaatc tgtttgacta sateactcafi aatceaagca ttgtattgct ISOO 

gttctggtga aaagtattgc attagacata cctcctgctG gtacggataa aggcagcgtt 1360 

tcatgftegt gtgctccgts cagcgf cttc tccttaattt tgattttitct gaaaataggt 1620 

cccgttccta tcactttac?; atggacggaa aacaaatagc tactaccatt cctcctgttt 1680 

ttctcttcaa tgttctggaa tctgtttcag gtacagacga tcgggtatga aagaaatata 1740 

Fags 10 



10493 , 2(M-W,st25 , txt 



tui'iM ti ra t" ? 




t" ma r 3 1' oa a 


accaittgta. aaagagtata caaatgacga 


1 800 


■x^ V-. N*. V" V V- t-V x- 




:s i"i OS 3 1 f r ;? 


gaasatgggt gttgegaaag 


aggatgtata 




ratr'i'tacjrt" 


>,--^j-v,--.\J.sy-. vv V- x.^ 


3 raoaa ■•"ooa 


acQcctcsgct gacaacacga 


acgccaacac 






a a a o s aiiCcj o 


f 3* !• t- r 3 a f3r-3 


Ggcggtggga aatatcttca ataaaaaagg 


13S0 


***if ^ y " 5? ^ " >^ 






cggtttttct gaagatga&g 












tctctttgtg acagataacg 


aaaaa§tgaa 






A A ;5/ir3 ^ ttti 




mQQCcmxu tcggcctttt ggtttttttg 






»5# V** ^ ^ ^■ 




CQcaatagga tsgcggaacsi: ttttcggttc 








^ liVo. V.«,X» fciX.^.. 


tgtgataaat tggaataaaa i:ct(:acaaaa 




■f* 5S ;5 5( J!J j5t fiErt 






aagtgatgtt agctacggct 


ttgtttttag 










attta^acca ctaoiacgttd 


ddatCcaatg 










caggc^gatc aaaagfcatcc 


tcctcaaatg 




^tsT A A flt^ 






tctcggcatt adagaaggaa 

>w VP. , fir ^.Vf. WW 


acgaacacaa 




t^<3'rt^?t^ Pi 7i St' 






ttgacatgaa cgtggatgac 


aatctgaagc 




r f5 r f 
V. vjv V vJJ*JV.wv■ 






agtatgattt ggacaaiatat 


ttgaaagcct 




-tk-V^^ V.\yV. Vh^*^ 






cgtcgggaac acaagaargaa 


gcgagagcaG 






SI a ss >^ a ft ft ft ft 




tSgtgg^tat c<:<;tgcaaac 


a^sgacgatcg 




tf'^rt fsi" S¥ 






9sgf aaactt ccaaatcaag 


adtgataafcg 




J* i3t ^ ^ f ^ i'* 






cctstsacta ttttccgcaa 


tggttgtaaa 




W w^'tt S(v:y y V ^ cs. 




jx *. s> a 'a f - ftri ^ 


tcagtgagag cgaagcgaac 


acttgiatttt 






?S 1* rfT f^T ^5 1" P 

S V V" V- V- «s' *ilCi' 










r ?s <" t* ^ 


n ?' "j"*^" t* t' n 


51 ■$* s? f I'i* r' 


taagttgagc atattafjatig 


aggaaaatct 




"Sh f3 f J 5i f3 ?5 S» "TA 


K >v VvM >.v (S, V,, 




gatcaggtac c§eaac<?ttc 


gcagatgctg 






V" %<■ V-.- v< X* -v> 


t" a ? ct c^' 


aa(5§ctatca attggtaact 


gtatctx-agc 




'tt' ft S3 da apt 


fiaaciassr^o 

ya<;!.4.j«.avj •-^.y 


3 ?! ;~! n .'T r T ;iTt" 

\j >.f J.J K. s.. i>. v. V. 


qaataaatga otaqaaagcg 


ccatatcggc 


>,< £. -'-t \l 


V V V t.. 1. 1.. V U U 


f 5f v^f ^ n '-^ 




atggtactts ttaaaaattc 


ggaatattta 




?i r ?j ?i T ^ "i^^ r 


J.",! lAJ i^iX 


A i' f \ ^ ;s i"i f 1 


ogogcctgct gtccagactg 


txxgctgtgt 




?5 ?J ?J ?} f } Mfi^ 






attttactcia tatgtataat 


ataatttgta 










gatigaascct tagataaaag 


tgcttttttt 








aatgttaagc 


ttaatt;aaa§ ataatatctt 


tgaattgtaa 


3S40 


cgcGcctcaa 


aagtaagaac 




gaatacgttra tatagaaata 


tgtttgaaec 


3600 




tacaaatata 


ttcggacgsa 


ctctacctca aatgcttatc 


taactataga 


ssso 


atgacataca 






aaaatafeaac taecaatgaa 


Gttgttcatg 


3720 


tgaattatcg 


ctpt^tttaa 


ttttctcaat 


tcaatatats atatgceaat 


acattgttac; 


3?B0 
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^ n i" :5 ^> 1^ 


?; ?! ^1 St r ;^ i" <" i" 




■^^ 3 ^ ai "J^ 3 f" 




"JS iT:^ ;3 "1"' "f " ^3 ;3 ^5 




n ;^ V ^ 's" fJ ^'^J^ A 










>.i. <.>.4 i. I. 1. JL. 


vJ v> V V 




t* t* .-^ n c5 1 tec 


?i ?i t' s ^ t* o 


t O l" ci t fl Si'f 

Sw-\4 5.-. >.{ k\ !..< lyX Vi. V» 


S f t" ^ 1" t" t'O 


s i5 s o Oc J G tS Q fj 

fti v4 St \^ V.^ V< ^ >^ 


3950 


1" f l r C ' t ?i ?^ ?i 

I.. t\ « 4^*, 


r €i ;^ i"^ r ;^ t 


■t- :;t ?t ;rt s s 




-1- -J-s* a T" « 




V *U ^' 


;^ fr ^ t" t' 3 i" 




jS "t* "f*^ SI "i* jfi*f^ si ^ 

K( 4tf 'CvKl 








40SQ 


rsss ^SS^ 5ii% Ai^^^ 




if^i^ A rst* 

^MUVj tij-n tvU 


rtrt 3 jft^s s* ?j 




•^f"^ ss fits 5i ^ 




^ V>M V ^ v« 








V V<4 y K. V .« *J V 


fj£'$' f >f3 f J SS 






rs *■ *^ 'J*--*^ *^ 


">^S!^ v^wb^ v.«v.«v. 






vjU<et' V tsr V 








■ 'S" ■'^ ■ 4* /** if^ ^ "t* +• 




V V V S- V 






I^Va^iCK. V i- Jv ~V 






■5ii-f^^f'rtrt3 IS 1^ IS JT' 












A "t^f^rs *^ t*"t* 








444fJ 
















5i a f iT 't' 't^fS 




.■4, ^Vrf«A,\,> ^5a^ 




"t* o!% 3 js Hi* "f^ 












m V V V 


t^^'S^ r si'i^*^ 
V v^^'V. Vs tx y *« *h\« 






>»v«A *^ i>.v5ij.y.<*.v. 








f^fs rs /I ^' ^ 


/*^j*'5*^i3 'i*??^ Si'53 








fta 3 V ^ t" 














jw jw.^ij. Ms'^s J* 










"i* t^i Jiff's 
^ Vf^^ V V y *y 






■ .£s rt "t* is ^ Ji* 3 








a ^ i'^ ^^ f^ 't' S? '+* f 










3 ia fSti^jH St SI St SS- 


4«^?fj 




*J* i*i f ^ i'J f*^' A 't^' 

t. y y y *^ w s«\f w u 


■ f^si^J S3' si' ^' si 






^ Sv Svt^ V L- 




/*i "f - ^ "j^ t- .-"^ 


.<-. .*> « <^ ^ 




^"f'T'a J5 f S5 f /J 








rA \.>-vJ-j t,. t.. t. 5-;t^ 


J'' ^ f"" ">' C t" 












^ y y y ^ 


n ■f' ^ t'" n "J" 


V.-wL\.^ V- v,^«;<-; \^ V, 






i ? j'- "t" i" 1^ f } 




f5 T €i 'A 1 1 3 O 








3 ^ * St r r r n r 




5220 


v.- v. V V 4-} C3. tw- 


f" n f J r ?J t" r" 


V t> t> V* >.< i> 1. 


y \.> vtj Vvi v.- V 


•J t" ■f'^ a f- f ' 'j* f * 


V- V-.<A V V.V- V v.- W 


■w* ^> W V/ 




Ci^ c c c ^ c ;-i T 




i - V t c ?i U C '-^ T 


1 1 1" ^' fjf^jr i ^j '" 








i" rs i" f" r" f" 1^ 


if ?f i" rj r" t" 


'.5 n 1" n r f J 

«j; vjx-. V. t. v> \j V. v. 


vV.ti tv w';^ v5 <5 










• j' ("i ft ;-i f! f ' f f a 








S'4f<0 


caaggcaaat 


ggtggtccat 


csgatttatc 


ctgtcgattg 


ttcatatcat 


cgtatggctt 


55 213 




gattatggtg 




ggMtatggt 






SS80 


ggtctgcctt 


tttttatttt 




tasgaaaatg 


gattttgaaa 


aatgagaaaa 


S640 


ttcectgtga 








<^t:ggtgctgt 




5?00 


cggttccatt 


tttccctg 










S7a.8 
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<2W> 23 

<2li> 2? 

<212> mA 

<21?i> Artificial seqygnce 
<Z2Q> 

<2Z3> Primer 16QS 

<40Q> 23 

gacggccagt gasttc§ata aaagtgt 27 

<210> 24 

<ZU> 41 

<2i2> BHA 

<220> 

<223> PriiMf 1606 



stfi sc.„fsature 
<22Z> (13).. (B) 
<223> n is 8, c, g, or t 

<220> 

<221> ml sc.„.f sature 
<223> (3 is a, Cy 8, »r t 
■<400> 24 

ccagatctct. atnktnktgt acgga^teta actccccaag ag 42 

<2;m 25 
<2;U> 1112 
<21,2> DMA 

<213> Hocardio^sis dassony-niei DS!*? 43235 
<4Q0> n 

gc'ttttagtt categatcgc srccggctgct ccggcccccg tctcccagac cccegtcgct SO 

gacgacagct? ccgccagcat gaccciaggcg ctcaagcgcg acctcgacct cacctcggcc 12S 

gaggccgagg agcttctctc ggcgcaggaa cjccciccatcg agaccfjacgc cgaggccacc IZO 

gaggccgcgg gegaggccta cgcicggctca ctgttcgaca ccgaga.ccct cgaactcacc. 240 

gtgctggtca ccgacgcctc cgccgtcgag gcggtcgagg ccaccggagc ccaggccscc 300 

gtcgtctccc acggcaccga gggcctgacc gaggtcgtgg sggacctcaa cggcgccgag 360 

gttcccgaga gcgtcctcgg ctggtacccg gacgtggaga gcgacaccgt cgtggtcgag 420 

gtQCtggagg gctcctiacgc egacgtcgcc gcectgctcp ccgmcgccgg tgtggactec 480 

tcctcggtce gggtggagga fgccgaggag §cccegcagg tctacge catcatcggc S40 

ggcctggcct sctacatggg c^gccgctgc tcc^teggct tcgccgcgac caaca^cgcc 600 

ggtcagsiccg itttcgtxac cgccg^cc^c tgcggcaccg tt^gmmgg cgtgaceatc 660 

ggcaacggca cGggcacctt ccagaactcg gtcttceccg gcaacgacgc cgccttcgtc ?2U 

cgcggcacct cmstcttmc cctgaceaaG ctggtctcgc gctacaactc cggcggctac 

cagtcggtga licggtacca^ ccaggccccg gccggctcgg ccgt^tgccg ctccggctGC S40 
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accaccggct giicactgcgg caccatccag gcccgcaacc agaccgtgcg ctacccgcag 900 

ggcaccgtct actcgctcac ccgcaccaac gtgtgcgceg aocccggcgsi ctccggcggt 960 

tcgttcatct ccggctciica ggcccagggc gtcacctccg gcggctccgg caactgctcc 1020 

gtcggcggca cgacctacta ccaggaptc acGccgatga teaactcctg gggtgtcagg 1080 

atccgsJiCvt aatcgcatgt tcastccgct cc 1112 

<2M> M 



<220> 

<223> Primer 1423 

<400.>- 2§: 

gcttttagti: catc^atcgc atcggctgct ccggeccccf ttci:tmu ^§ 

<210> 27 

<211> 4S 

.<2X2> DM 

<21.3> Artificial s^quiehce 

<220> 

<223> primer M7S 



ggagcggatt gaacstgcga ttaggtccgg atcctgacsc: cccag 4S 

^2:i0> 2S 
<211> 354 

<2i2> mr 

<2ll> Nocar'di sjjsl s classi>nv1 1 1 ©1 
<220> 

<221> PiROPE?> 

<222> m-'itm 

<220> 

<222> (167),.<354> 

<i^1a Pro Ala Pro Val Pro Gin Thr Pro Val Ala Asp Msp sef Ala 
~16S -160 -155 

Ala ser r4$t "rt^r slu Ala ten lys Arg Asp Lea asp l«u Thr ssr 
-ISO -145 -140; 

Ala Glu Ala Glti Gly Leu Leu Ser Ala <3tn: Slu Ala Ala il^Mdlu 

-3.39 ~12S 

thr Asp Ala slu Ala Thf <Slu Ala Ala Gly <STu Ala tyr islv Gly 
"120 -115 -lie 
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Ser Ley phe Asp Tbr Glu Thr €tu Leu Thr val Lsu Va1; Thr As}> 
~10S -100 -m 

Ala ser Ala Val Ala Val Slu Ala thr «1y Ala Ala T^sr Vsl 
-90 -85 -SO ~?S 

Val ssf His <a1y Tisr <iTu Sly Leu: Thr Slu Val Val filu Asp teu Asn 
-70 -65 -60 

<3ly Ala ssTu Val pm iSlu Ssr v^l Leu sly trp TVr Pr*o Asp val slis 

Ser Asp Thr Val val val clu val .Leu slu (Sly: ser Asp Ala Asp Val 
-40 -m 

Ala Ala Leu teu Ala as|j Ala sly val Asp s«r ser ser val Arg val 
-25 ~2S -15 

Glu Glu Ala Glti Gls4 Ala pro <sln Val tyr Ala a$|) lie xle Qly aly 

Leu Ala Tyr Tyr Met &y .sly Arg Cys. Ser val 61 y Phe Ala Ala Thr 
10 is 20 

Asn Ser Ala Gly Sin pro Gly Ph© Val Thr Ala ^ly His Cys <sly Thr 
n 30 3S 

val Gly Thr iSly val thr lie Gly Asn Gly Thr 6ly Ttur Phs <sln Asn 
40 4S ' SO 

sar val phe Pro Sly Asn Asp Ala Ala Phe Val Arg <3ly Thr Ser Asn 

go 65 70 

Phk Thr Leu Ttsr Ash Leu Val Ser Arg tyr Ash Ser- sly <5ly tyr 61 n 
75 BO SS 

Ser Val tlir gIv Thr Ssr Gin Ala Pro Ala (5ly ser Ala val cys Arg 
90' 95 100 

ser Gly ser Thr Thr <siy Trp His cys Gly Thr lie aln Ala Afg Asn 

Sin Thr val Arg lyr Pro Gin Gly Thr Val tyr ser Leu Thr Arg thr 
im MS ' 130 

Asn val cys Ala Glu Pro Sly Asp Ser Gly sly ser pM ll& Ser «ly 
13 S MO 14 S ISO 

Ser Gin Ala Gin Gly val thr ^r €ly sly ser Gly As« cys Ser Val 

135 160 im 



Sly Sly Thr rhr Tyr Tyr Gin Glu VaT Thr Pro !*5et lie Asn Ser Tm 
170 i?5 ISO 

Gly \^sl Arp xle Arg thr 

<211> 4SS 
<212> 

<m> mmrmopsis ^msmym&i Bsn 43235 
<400> 29 

§ctccggccc ccgtccccca gacccccgtc gccgacgaca gcgccgccaf eatgace§ag 60 

gcgctcaagc gcgacctcga cctcacctcg gccgaggcc^ -aggagcttct ctcggcgcag 120 

gaagccgcca tegagaccga cgccgaggcc accgaggccg cgggcgaggc ctacggcggc ISO 

tcactgttcg acaccgagac cctcgaactc accgtgctgg tcaccgacgc ctccgccgtc 240 

gaggcggteg aggccaccgg agcccaggcc aecgtcgtct cceacggcac cgagggcctg 300 

accgaggtcg tggaggacct caacggcgcc gaggttcccg agagcgtc.ct cggctggtac 360 

eeggacgtgg agagcgacac cgtcgtggtc gaggtgctgg agggctccga cgccgacgtc 420 

Hecgccctgc tcgccgacgc cggtgtggac. tct:tcctc0g tccgggt^ga o^acigccgag 480 

gaggccecgc aggtctae 49S 

<2i0> 30 

<211> 166 
<21?> ps:r 

<213> f^ocardlopsls iigsspfivillei OSM 43235 

Ala Pro Ala Prp Vai Pro Gin thr Pro Val Ala Asp Asp Ssr Ala Ala 
1 5 10 15 

ser lytet thr Glu Ala te« Lys Arg A;sp teu Asp teri 11?r ser Ala 
20 25 20 

Ala Glu Glu Ley Leu ser Ala Gin Slu Ala AM lie slu thr Asp Ala 
3S m 45 

Glu Ala thr Glu Als Ata. Gly slu Ala Tyr sly Gly ser tm Fhe Asp 
SO 55 $Q 

Thr Glu Thr Leu slu Leu thr Leu Va^l thr Asp Ala. Ser Ala val 
gS 70 75 80 

«1« Ala Val elo Ala Thr sly Ala Gin Ala Thr Val VaT sef Hts Gly 
85 90 m 



Thr <3lu Sly Ley Thr <j1u val val Glw Asp km Asn <3ly Ala €lu val 
3.00 105 110 



Fr« «lu ser vai Lm «ly Trp ryr Fm Asp val Glu s«r Asp Thr val 
lis 120 125 

Val val Glu Val teu Glu g1v Ser Asp> Ala Asfs val Ala Ala L&u Lew 
130 135 140 

Ala Asp Ala Sly val Asp sar sel- Ser val Af-g val sIsj S1« Ala elu 
145 ■ 150 iSS 160 

6lu Ala Pm Gl(i vsl Tyr 
<212> DNA 

<213> Artificial sequei^ce 
<220> 

<22S> Ths oSijA sequence csdim? for th<a pm-rsgion of SEQ lo HO: 20 fused 
in frame to- A191SL2 prot&ase tail-variant encoctln^ aene; whole 
cons-truct;: 10RCpn)Al91SL2>. 

<490> 31 

atgaagaaac cgttggggaa aattgtcgca agcaccgcac tactcatttc tgttgctttt 60 

agttcatcga tcgcatcggc tgctccggcc ccegtccccc agacccecgt c0ccga:cfa:c i2S 

agcgccgcca gcatgac«;§a ggcgctcaag cgc^scctcg acctcacctc ggecgaggcc ISO 

gaggagcrttc tctcgg<pgca gctaagccgCG atcgagaccg acgccgaggc c^ccgag^cc 240 

gcgggcgagg cetacggcgg etdaetgttc gacafccgaga ctctcigaact caccgtgctg 300 

gtcacC^sicg cctc<;§ccgt cga^gcc^gte gaggeeaceg giagctcaggc, taccgtcgtc 36D 

tecctSGggca ccgagggect gaccgaggtc gtggaggact tcaacggcgc cgaggttccc 420 

gagagcgtcc tcggctggta cccsgacgtc? gagagcgaca ccgtcgtggt cgaggtgctg 480 

gaSSgctccg aegccgacgt cgccgccctg ctcgccg-sctj ccggtgtgga ctccrxc-tcg S40 

gtcegggtgg aggaggccga gg^SS-s^^^i^cg cisggtctatg ccgata-tcat tggaggccta 600 

gcgtacacaa tgggtggtcg cLgcagcgta ggatttgcag ccacaaatgc. aeictggacaa 660 

cctggcttcg tgacagctgcf acattgcggc cgcgtcggta cscaggttac tatcggcaat 720 

SOaagaggtg -fxtttgagca aagcgtat.tr. cccgggaatg atgctgcctt cgttagaggt 7S0 

acgtccaact ttacgcttac taacttagta tctagataca acactggcgg atatgcaact 840 

gtagcaggtc acaatcaagc acctattgge tctagcgtet gccgctcaga Stcgactaca 900 

ggatggcatt gtggaaccat tcaagctaga ggtcagagc^ tgagctatcc tgaaggtacc 960 

gtaacgaaGa t^actcgtac gactgtat;gt gcagaaccag gtgactctgg aggttcatat 1020 

atcagcggta cgcaagcgca aggegtitacc tcaggtggat ccggtaactg taggacaggt lOSO 

ggcacaacgt tctaccagga agtgacaccg atggtgaact cttggggagt tagactcegt 1140 
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<21Q> 32 

<211> 106S 

<212> BNA 

<213> s^iocard-iopsis Alba oss.'! 1564? 



<400> 32 

?? ^^i^ ^23 SS V V 










60 












110 




r if ■ t' f* ^5 A t" 


s<t^ V 






ISO 




M K. v.. !sf <>. 








74.(i 


fi -t" i"^ n ci^i ''"J n 

y lA.y 


■ih fin 

u v-T. vj y V.-. ^ y V. 


\.^\^\^ v. t,.- «y 




iTfiri s; i^ii^ifi 

V v.Ki I, y y iJt VJi~J;!~J 'a,,. 


^(\{\ 
vv 


f "i' A ^ f u'; ^^ A 


L. (a Vs. V ^ y^"- 






r* 1" »" i" f' 1 i' ''1 f\ /' ' f 3 f « 






"f" f ^ i^ f' ^ C/^ ^ 
t-v Iv Ci 4.,- V. v.- 


^'Ai'^ i"\ n t" 




f- j-^ 'J r< ;-i n ■ft^iNi^' rt f' r-i^ "a 

i. v.. y i:t\J Sj '■•V ^-^jy y t-x.^ttv. 




fj"*^" r: n r t r r c 


'i'^ti i'^V Cft C 




J 1 1. V- (-1 v y w w 










f "if r ;^ r tT" 


^ 1' r fjfi f -cji'^it" 










V- v- >-t. V- V. jajx v.. 


J, I, V- V-JK^JV V- 








V; <i W w vf w vf Vj 


i" f" r r {' 11 (t f 


n t" ;^ 1 1" n f ^\ 






cgt.tccgtct 


tccccggcaa 


cgact ccgcc 


ttcgtccgcg 


gcacctcgaa cttcaccctg 


720 






caacaccggt 




ccgtctcc99 etcctcgcag 


780 


gcggcgatcg 


gctcgcagat 


ctgccgttcc 




CGggetiigca ctgcggcacc 


840 


gtccaggccc 










900 






cggtgactcc 




tcatctccgg ciagccaggtc 


960 








tgctccttcg 


gtggcaccae ctaictacGag 


1020 








ctgaccctgc 


gcacctgs 


loss 



<2i:i> 3S5 
<'il2> par 

<213> Noeardlopsis Alba DSM 15647 



<220> 

<221> PROPEP 
<222> Ci)./a67) 

<220> 

<222> (168).; (3555 
<4D0> 33 

Ala Thr «1v pro L6U Pro Gin Ser l>r» Thr Asp sM Ala gTu 

-165 ^im -iss 



Ala Ttsr Thr ^let val 61 u Ala Lets Qln Am ass lsu Gty Lsu ser* 
-ISO -145 



f>m Ser Gin Ala As|> Glu i.m Lm Glu Ala sin Ala Slu Ser Phe 

Paga IS 



-135 



104$5. 204~VilO, ST2S . txt 

-mo 



Glu lie. Asp Glu Aia Ala Thr Ala Ala Ala Ala Asp ser Tyr 61 y 

"120 -lis -aid ■ 



Gly ser lie Phe Asp Thr mp ser Leu TStr Leu Tlir Vsl ieu val Thr 



Ala ser Ala val alu Ala \'al elu Ala Ala sly Ala <s1y Ala 



Val Val Ser HTS *Sly Met JSlu sTv Leu Slu Slu lie Val Ala Asp Leu 
-7S -70 -65 -60 



Asii Ala Ala Asp Ala Gin Pre- Gly V&i Val Gly Trp tyr Pro Asp lie 

~:SS -^50 -4S 



His S^r Asp Thr val Val Leu Slu Val Leu Slu Gly Ser Gly Ala Asp 
-40 -33 -30 



Val Asp ser Leu Leu Ala Asp Ala Sly val Asp Thr Ala Asp Val Lys 
-2S -20 -IS 



Val Qlu Ser Thr thr 6lu <sl» pro 6lu Leu ryr Ala Asp Xle lie Gly 
-to ~5 "1 i 5 



Gly Ley Ala Tyr tHr mt Gly Gly Arsi cys Sef* Val 6iy Piie Ala Ala 
10 " M 20 



Thr Ain Ala Ser Gly ^Ih Pro Sly Phte Val tlir* Ala Gly His cys Sly 
2$ 30 35 



Thr Val sly Thr pro Val Ser xle Sly Asn sly sin sly val Fhe Glu 
m 45 50 



Arg Ser val Fhe pro sly Asn Asp ser Ala Phe val Arg Gly Thr ssi^ 
SS 60 65 



Asn Phe Thr Leu Thr as n Leu val Ser Arg Tyr ash Thr c-ly Gly Tyr 
?Q 75 e(j 85 



Ala Thr Val ser Sly Ser ser sin Ala Ala, xle Gly ser gIr xle cm 
M m 100 



Arg ser Sly ser Thr Thr Gly Trp His cys sly Thr Val Glsi Ala Arg 
IDS 110 IIS 



Sly Gin Thr Val Ser ryr pre Gl« Gly Thr val Gin Asn im Thr Ar« 
120 12 S BO 



Thr As» Val Cys Ala Slu pro Sly Asp Ser Sly Gly Ser Phe lie Ser 



85 
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10495, 204-»,ST25 .txt 
135 14Q 14S 

<slv ser Sin Ala sin Gly -val Thr ser <a1y -sly ser sly Asn Cvs S&r 
tm 155 ISO ' ■ 165 

Fhe Gly sly Tfer Thr Tyr Tyr «1n 6lu ¥al Asn Pm m% im ser S«r 
170 X7S 1§0 

Trp Sly L6U Thr Ley ArgThf 

<210> 34 

<21.1> 43 

<2X2> DMA 

<2,13> Artrficial sequence 
<220> 

<22S> Primer 1,421 

<400> 34 

gttcatcQat cgcatcggct gcgaccggcc oxtccccca gtc 43 

<213> Artificial sequsnce 
<IM> 

<Z2%> Primr 1604 

gcggatccta tcaggitgcgc agggtcagac c 31 




16. 



Nocardlppsis isfasina D5S^ 15648 

<400> 36 

gccaccggac cgctccccca gtcacccacc ccggagcjccg acgccgtctc catgcaggag 60 

gcgctccagc gcgacctcgg cctcjaccccg cttgagoccg atgaactgct ggccgcccag 120 

gacaccgcct tcgaggtcga cgaggccgcg gccgcggccg ccggggacgc ctacggcggc 180 

tccgtcttcg acaccgagac cctggaactg accgtcctgg tcaccgacgc cgcctcggtc 240 

gaggctgtgg aggccaccgg cgcgggtacc gaactcgtct cctacggcat cgagggcctc .300 

gacgagatca tccaggatct caacgccgcc gacgccgccc ccggcgtggt cggctggtac 360 

ccggacgtgg cg^ftjgacac cgtcgtcctg gaggtcctgg apgQttccgg agccgacgtg 420 

agcggcetge tcgccgacgc cggcgtgiific gcctcf gccg tcgaggtgac cagcagtgcg 480 

cagcccgagc tetac§ccga catcatcggc ggtctf gcct acaceatggg cggcc§ctgt B4Q 

tcggtcggat tcgcggceiac caacgccgcc ggtcagcccg gattegtcae egccggtcac 

tgtggcc^cg tgggcaccca ggtsagcatc ggcaacsicc agggcgtctfc cgagcagtcc 

atcttcecgg gcaacgacgc cgccttcgtc Ggcggcaegt ccaacttcac gctgaceaaG 72Q 

Paqe 20 



10495 . ST25 < txt 

ctggtcagcc gctacaacac cggcggttac gccaccgtcg ccggecacaa ccaggcgcxc 7SQ 

atcggctcct ccgtctgccg ctccggctcc accaccg«ct ggcactgcgg caccatccag 840 

gcccQcggcc agtcggtgag ctaccccgag ggcaccgtca ccaacatga*; ccggaccacc 900 

gtgtgcgccg agcccggcga ctccggcggc tcctacatet ccggcaacca ggcccagggc 960 

gtc&cctccg Qcggctccgg caactgccgc accggcggga ccaccttcta ccag§a§gtc 1020 

acccecatgg tpactcctg gggcgtccgt ctccggacct aa 1062 

<210> 37 

<211> 353 

<2I2> PRT 

<213> Nocari|iopisis praslna DSM 13648 

<220> 

<22I> PmPBP 

<2t2> ,0.>.. 



<220> 

<221> mac.p'Sptide 
<222> (166) . - (3SS}: 

<40a> 37 

Ala Thr Sly pro Um Pro &n Ser Pro Thr Pro &u Ala Asp Ala 
-165 ~im "ISS 

val S6i" Met Sin al« Ala teu Gth Ar^ Asp teu sly leu thr" Pro 
-ISO -145 -140 

L&a <slu Ala Asp (slu tm Lm Ala Ala Gin Asp Thr Ala Phe slu: 
-135 -^iM -125 

val Asp 6lu Ala Ala Ala Ala Ala Ala isiy Asp Ala Tyr <sty Gly 

Ssr val ??he Asp thr Glu Thr Leu sly Lsu Thr va1 Leu val Thr a 

Ala Ala S^r Val Glu Ala Val Ala Thr 61 y Ala <sTy Thr clu Leu 
"85 -SO -75 

val Ser Tyr Qly lie Glu Sly Lsu Asp Glu lie rils Gin asp Leu Asn 
-70 -SS -SO 

Ata Ala Asp Ala Val pro sly val va1 Sly trp Tyr pro Asp Val Ala 
-55 -SO -4S 

Gly Asp Thr Val yal teu Glu Val Lau Glu <aly Ser 61 y AT^ Asp ^al 
" -40 -fS -30 

ser Gly Leu Leu Ala Asp Ala .Sly Val Asp Ala Ser Ala val Glu val 

-20 " -15 -10 

Page 21 



10455 , 204 , ST25 - txt 
Thr ser ser Ala <5tf5 Fro «s1u teu Tyr Ala Asp lis lie €ly Gly Lsu 

jkta ryr Thr Met Gly 4ly Arg <Cys Ser Val ^ly Phe i^vla Ala Thr Asn 
10 15 " 20 

Ala ^Ta <3ly <Slo Pro sly Phe Val Thr Ala sly His fcys &y hrg v&l 
25 30 3S 

lly Thr Gl h Val Ssr Xle Gly Asn Sly &n i?1y Val Phe <Slu Sl« Sep 
40 4% m S3. 

lie Phe Pro Gly Asn Asp Ala Ala Phe Val Arg <sly Thr ser Asn Phe 
60 m 70 

Thr Lm Thr Asn teu val ssr Arg lyr Asn Thr Sly Gly lyr Ala Thr 
75 80 85 

Val Ala Gly His Asn Gln Ala Pro He Sly Ser ser val cys Arg ser 
90" 9S 100 

Gly Sisr Thr Thr «ly Trp His Cys <sly Thr Xle Glrs Ala Arg sly sin 
:ms liO lis 

ssr Val ser Tyr pro 61 u «ly Thr val rhr Asn Met Thr Arg thr rhr 
121) 125 130 Its 

Val Gys Ala Glu rro sly Asp Ser «ly sly ser tyr Xle ser Gly Asn 
140 145 150 

Sin €ln Gly Val Tftr ser Sly ^»ly ser sly Asn cys Arg Thr ety 
155 160 165 

sly Tlir Thr pM Tyr Gift val Thr Pro mt val Asn ser Trp Sly 
170 175 180 

val Ara isu Arg Tlir 

-<210> 38 
■<2i:l> 43 

<212> om 

<213> Artificial se<^«ehc«; 
<220> 

<I25> Prmur : 134| 
<4m> m 

gttcatcgat cgcategget ^ecaccggac cgctceccca gtc 43 

<aiO> 39 
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10495 ,204-»,ST25 . txt 

<212> DNA 

<213> Artificial sequsnee 
<21Q> 

Primm 1602 

<40G> 39 

gcggatccta ttaggtccgg agaeg gacgc cccaggag 3f 



<21.0> 40 

<;211>- 1062 

<2i2> mh 

<Z1S> Nocardiopsis prasina DSM 15649 

<400> 40 



gccaccggac 


cactccccca 


gtcacccacc ccggaggccg acgtcgtctc catgcaggag 


60 


gcgctccagc 


gcgacctcgg 


cctgaccccg cttgaggccg atgaactgct ggccgcccag 


120 


gacaccgcc.t 


tcgaggtcga 


cgaggf-cgcg gccQaggccg ccggtgacgc ctacggcggc 


180 


tccgtcttcg 


acaccgagac 


cctggaactg accgtcct^ig tcaccgactc cgccgcggtc 


240 




aggcc&ccgg 


cgccgggacc gaactggtct cctacOQcat cacgggcctc 


30Q 




tcgaggagct 


caacgccgcc gacgccgttc ccggcgtggt cggctggtac 


360 


ccggacgtcg 


cgggtgacac 


cgtcgtgctg gaggtcctgg agggttccgg cgccgacgts 


420 


ggcggcctgc 


t-cgccgacgc 


cggcgtggac gcctcggcgg tcgaggtgac caccaccgag 


4m 


ciujcecgagc 




catcatcgge ggtttggcct acaccatg^g cggecgctgt 


540 




tcgcggccac 


cascgccgcc ggtcagcccg ggttcgtcac; cgccggtcac 


600 




tgiggcacGca 


ggtgaccatc: ggcaacggcc sggfcgtctt cgagcagtec 


560 






cgccttcgtc egcggaacst ccaacttcac gctgaccasc 


720 




gct^caacac 


tggcggctac gccac¥:gtcg ccggtcacaa ccaggcgccc 


7S0 




ccgtctgccg 


tt^cSQCtcc acqaccggtt ggcactgcgg; eaccatccag 


840 




agtcggtgag 


ctaccccgag gg^accgtca ccsacatgac gcggacc^cc 


too 




^gcccggcga 


ctcxggcggc tcctaca:tct ccggGaacca ggccqagggc 


m& 


gtcacctccg 


gcggctccfg 


caactgccgc aGcggcggga fccatctttfta cicaggaggtc 


1020 




tgaactcctg 


gggcgtccgt ctccggacct aa 


1062 



<210> 41 

<2ia> ,353 

<212> PRT 

<2i3> Mocafdi^psis prasina DS^ XS649 



<220> 

<221> PROPEP 

<222> (1>.>C16S) 

<221> mt-^&BpxAcM 

<m> (166},. (351) 



1049S . 204- UfO. ST25 - txt 
Ala Thr Glv Pro teu Pro Sin Ser Pro Thr Pro «3ltJ Ala Asp Ala 



va1 ser Met Slrj Slu Ala ieu Gin Ars| Asp Leu Sly Letj Thr Pro 
-156 -:14S ' 



Ley Slu Ala Asp a1u teu L«u Ala Ala Girt Asp Thr Ala. Phe Glu 
-3:35 -X30 

Val Asp Si u Ala Ala Ala slu Ala Ala Gly Asa Ala Tyr Glv sly 
-120 -12.5 ^IM 

Ser val ^he Asp thr gIu Thr Leu Glu Leu Thr val Lew val Thr asd 
-105 -100 -3% -90 

ser Ala Ala Val 6Tu Ala Val <Slu Ala Thr Sly Ala Gly Ttir 6l« teu 
-8$ -to 

Val ser Tyr 6lv ll« iNr Gly Leu Asp <siu ile Val Slu gIu Laa Ash 

~?0 -55 -60 

Ala Ala Asp Ala Val f»m &1y vat Val 61 y trp xyr pro Asp val Ala 
-55 -50 --45 

<sly Asp Thr val val Leu <slu val teu ciu sly ser sly Ala Asp \fal 
-40 -35 -30 

Gly &y IM Leu Ala Asp <sly val Asp Ala ser Ala Val gTu val 
-2S -20 -IS -10 

Thr Thr thr Glu Gin Fro Slu Leu Tyr Ala Asp> i1e xle Gly sly Leu 
-S -11 ■ S 

Ala Tyr Thr mt c>Ty Gly Arg tys ser Val Gly Phs Ala Ala rhr ash 
10 15 20 

Ala Ala Gly Sin Pro Gly phg val rhr Ala sly His Cys Gly Are val 
25 30 35 

<sly Thr Gin Val Thr lie <3ly Asn €ly Arg <sly val PM s'lu Gin ssr 
40 4S SO 55 

lie phe Pro Gly Asn Asp Ala Ala Ph© Val Arg Gly Thr ser Asn Phe 

til r Leu Thr Asn Lm ml s«r Ars tyr Asn thr Gly el y tyr Ala Thr 
75 80 m 

Vkl Ala Gly His ASii sl« Ala Pro Tie Gly Ser ser val cms Am Ser 
SO 95 100 

Faa© 24 



. 204 -isiO , ST2S , txt 

Gly Ser Thr Thr sly irp Bis cvs gTv Thr rie sin Ala Am tsly sin 

i0s lie - ' ^ - 

Ser val ser Tyr Pro Si a «ly Thr vai Thr Asst Thr Am Thr Thr 
120 12S 130 BS 

val cys Ala Glu Pm Sly Mp Ser Gly <Gly Ser Tyr lie ser Sly Assi 
140 %4 iS'O 

Sin Ala <s1n Sly Val Thr s«r sly aly ser Gly Asn cys AFg Thr s1y 
155 160 ' 165 

<5ly Thr Thr ?»he Jyr 6lo <Slu Val Thr >fo mt Val Am Ser Trs <sTy 
170 175 180 

vat Arg Ley Arg Tbr 

<210> 42 

^2ll> 43 

<2'13> Artificial sequence 
<220> 

<223> f>r1m<er 1603 

^400> 42 

gttCatcgat cgestG§gct gecaceggac csfctcfeccca gtc 43 

<210> 43 

<211> 353 

<2:12> MT 

<213> fijocardlopsis sp. I^RRL 18262 
<220>- 

<221> PROPEF 

<2a2> CD . . C16S) 

<220> 

<221> ffsat„p«ptide 
<222> (166) . "< CiqS3) 

<400> 43 

Ala Thr Gly Ala Lsu 9m Gin ser pro Tfir pro 6l« Ala Asp Ala 
-MS "160 -iSS 

val Ser Met &n Gl« Ala Lets slfi Arft Asp Lm Aso km Thr^ ssr 
-ISO "145 -140 

*<la slu Ala «]y slis Leu tew Ala Ala Gin Asp thr Ala Ph& 6lu 
-135 -13Q -125 

Val Asp Slu Ala At® Ala alu Ala Ala Sly Asp Ala Tyr Sly Gly 
-120 -115 " -110 



104§5 , ST2S , txt 
Ser Val Fha Asp ihr Glu ssr Lea <Slu Leu Thr Va'l Leu val Thr Asp 

A'sa Ala Ai& Val Q]y Ala val Glu Ala T^r Gly Ala Gly Thr 61 u Lets 
-85 ~S0 ~7S 

Val Ser Tyr <5ly lie Asp &y Leu Asp €lu lie val Gin <STu Leu Asn 
■70 «6S 

Ala A'la Asp Ala Val Fro a1y Vat val sly Trp lyr pro asp val Ala 
-SS -45 

ss'^y ASD Thr Val Val im Olu val leu <a1u Sly Ser Ala Asp Val 
-40 ~3S ^30 

ser cly leu leu ATa Asp Ala sly Val asp Ala Ser Ala Val <3lu va1 
-25 -20 -is -10 

Thf thr ser Asp <aln Pro slu Le^u Tvr Ala Asp xlo lie Gly Gly Ley 

Ala Tyr rhr mt Gly <?ly Arg <Sys Ser Val Gly Pile Ala Ala Thr Asn 
10 15 20 

Ala Ala Sly Gin Pro sly Flio vsl Thr Ala sly His cys <siy Arg val 
is m 3S 

Gly Thr Gln Val Tfsr I Tie sty Ash Sl?^ Arg Gly Val Phe Glu Gin S«r 
m 4S SO SS 

Val Phe pro Gly Asn Asp Ala Ala Ph^ val Arg Glv thr ser Asn Phs 
SO 0% ' 70 

Thr tea Thr Asn Leu Val Ser Arg Tyr Aso Thr Q}y sly Tyr Ala Thr 
75 80 m 

Val Ala Gly Hjs Asn Gin Ala Pro lie Gly ser Ssr Va! Cys Arg ser 



Gly ser Thr Thr Gly Trp His cys fily Thr si e sin Ala Arg Gly Gin 
lOS im 115 

ssr Val Ser ryr Pro Gly Gly Thr val Thr Asn set Thr Ara Tlir Thr 
120 125 im 135 

Val evs Ala sly pro s1y Asp ser &y Glv ser Tyr lie ser Gly Thr 
MO Ml t^O 

sin Ala Gin e1y val Thr Ser <Sly Sly se.r sly Asn Cys Arg Thr Sly 

155 160 ISS 

Page 26 



10495 , 204~W,ST2S « txt 

qga att gat ggc ctt gat §aa att gtt eaa gaa etg aat gca get gat 414 
Gly lie Asp mv Lets Asp i1s Val <s1n alu Leu Asn Ala Ala Asp 

-70 " -65 -60 -55 

get gtt ccg ggc §tt gtt ggc t§g tat ccg gat gtt get gqa gat aca 462 
Ala Val ?m Gly val Val Sly Tr*j> lyr pro Asp Val Ala €ly Asp Thr 
-SO ^45 -40 

gtt gtG ctt gas gtt ctt SM gga tea ggc gea gat gtt tea gge ctg 510 
val Val Leu si a val l«u gIu sty Ser Gly Ala Asp v&l ssr sly leu 
-35 :t30 «:2S 

ctg gm gac sea s^a gtc gat gca tea gca gtt gaa gtt aca aca tea SS8 
Leu a1^: Aip Ala <3ly va1 Asp Ala ^Ala val slu val thr tisf Sfes? 

■ ■ -20 " ■ -is ■ ■ ■ 

gat caa cce gaa ctt tat qca gat att att ggc gac ctg gca tat tat 606 
Asp <5ln Pro 61 « Ley lyr Ala Asp He He Gsy sly Le« Ala Tvr Tyr 
-S -X 1 S ' 10 

atg gqc ggc aga tgc age gtt gqc ttt gca gca aca aat gca tea ggc 654 
Mst Gly Cslv Arg Cys Sar Val Gly Pha Ala Ala Thr Asn Ala Ssj" Gly 
IS 20 25 

caa cci ggc ttt gtt aca gca ggc cat tgc ggc aca gtt ggc. aca cca 702 
Bin: Prb Sly Ph© Val Thr Als Sly His Cys Gly Thr Val sly Thr Pro 
30 3S 40 

gtt tea att §^c. aat gsjc aaa ggc gtt ttt gaa cga age att ttt ccg 7SQ 
Val Ser lie Gly Asf? Gly Lys Gly Val Phs Slu Arg Ser l1s Rhe Pro 
4S 50 SS 

ggc aat gat tea gea ttt gtt aga ggc aca tea aat ttt aca ctt aca 798 
Gly Asn Asp ssr Ala i»he Val Arg Gly Thr Ser Mn Phe Thr L^u thr 
60 6S 70 

aat ctg gtt tea aga tat aat tea ggc ggc tat gca aca stt gca ggc 846 
Asn LeU Val Ser Arg Tyr Asn Ser Sly sly Tyr Ala Thr val Ala sly 
?S m 85 90 

cat aat caa gea eeg att ggc tea gca gtt tge aga tea g<|C tea aca 
His Asii slii Ala Pf-o lie Gly ser Ala Val Cys Arg Sfer g1v ser Thr 
SS 100 lOS 

aca gqc tqa cat tgc ggc aca att caa gca aga aat caa aca gtt agq 942 
Thr sly Tr& His cys <*iy Thr lie sin Ala Arg Asrs Gin ihr va1 Arg 
110 115 120 

tat ccg caa ggc aca gtt tat agt ctg aca aga aca aca gtt tgt gca 990 
Tyr pro sin sTy Thr val Tyr ser leu Thr Thr rhr val Cys Ala 
125 130 135 

gaa ccg ggc gat tea mc ggc tea tat att age ggc act caa gca caa 1Q3S 
slis Pro Gly Asp ser sly sly ser Tyir a:1e S^sr sly Thr sirs Ala Gin 
140 MS ISO 

ggc gtt aca tea ggc gge tea ggc aat tgc agt get ggc ggc aca aca X0$6 
G'iy Val thr Ser Sly my Ser sTy Asn cys Ser Ala Sly Gly Thr Tlir 
iSS 160 16S ' 



tat tac caa gaa gtt aat ccg atg ctt agt tea tgg ggc ctt aca ctt 1134 

Tyr Tyr Glfi slw Val Asn Pro Het Leu S^er ser Trp sTy Leu thr teu 

175 180 185 

asia aca caa teg eat gtt caa tee get cca 

Arg Thr Gin ser His val slii Ser Ala Pro 

im 19S 

Pags IS 



10435 . 204~».ST25 , txt 




4S 

38S 

PRT 



<213> Artificial sequence 



<I23> s>?fttl^etic eenstryet 
«40G> 45 

Met tys tvs Pro Lm <5ly lys lie val Ala ser Thr Ala Lm Leu 



lie Ser val Ala fhe sar Ser ser xle Ala ssr Ala Ala Thr (5"ly 
-175 -170 -165 



Ala Leu Pro Gin ser Pro Thr Pro filu Aia Asp Ala val S«r tViet 
-^MO -155 -ISO 



elri Glu Ala Leu Gin Arg Asp Leu Asp Lea Thr ssr Ala gIu Ala 
-MS -140 "135 



Glu slu Leu Lsu Ala Ala Gin Asp Thr Ala Pha Glu Val Asp &u 
■130 -lis -mi 



Ala Ala Ala Glu Ala Ala sly Asp Ala Tyr <siy Gly Ser Val Phe 

-115 -110 ~ias 



Asp Tliir Glu Ser Leu Glu Leu Thr va1 Leu ¥al Thr Asp Ala Ala Ala 
"100 -05 -90 



val Glu Ala val <slu Ala 11ir sly Ala Sly Thr val lsu val s&r fyr 
■8S -SO "/S 



Gly lie Asp Gly Lau Asa Glu iTe Val Gift Slu Leu Asn Ala Ala Am 
-M -65 ^60 -S5 



Ala vaT pro Sly Val val <sly trp tyr fro Asp Val Ala Sly Asp Thr 
-SO ' --45 ■ -40 



val Val Leu STu val Leu Glu Gly ser Gly Ala Asp Val ser Gly Leu 



Leu Ala Asf? Ala Gly val Asp Ala ser Ala val Glu val Thv Thr ser 
-20 ' -15 -IS 

Asp lain fro Glu Leu Tyr Ala Asp lie :ile -sly. Gly km Ala Tyr Tyr 
-3 ~1 1 3 10 

Met Gly Gly Arg cvs ssr val Gly Phe Ala Ala Tiir Asn Ala ser Gly 
" IS 20 25 

Page 29 



-30 



61« wo Gly Pha val r\w Ala Sly His cys Sly Tfer Val Sly Thf Pro 
~ 3a 35 " 40^ 

Val ser Xls Gly Asn sty Lys Gly Val phe €lu Am ser ite Phe Pfo 
45 S#" 55 

fily Asn Asjj Ser Ala Phe val Arg Gly Tl^r Ser Asn ihr Leu Tbr 

A&n Lm Val ser Arg Tyr Asn ser Giv €ly Tyr Ala Thr Val Ala <s1y 
?S Sd S3 90 

His Asn 6ln Ala Pro lie <3lv ser Ala val Gvs Arg s«r «ly ser Thr 

9s mo m 

Thr sly rrp His Cys €ly Thr Xle sin Ala. Arg -Asn 6.1 n Thr val Arg 
110 lis 120 

Tyr Pro Sin sly Thr Val Tyr Ser Leu Thr Arg Thr Thr Val Gys Ala 
125 130 13S 

Sl« Pro Gly Asp s«r sly <a1y ser Tyr lie ser sly thr Gin Ala sir? 
148 14S ISO 

Sly Val thr ssr 0ly Oly ser sly Asn cys ser Ala Sly Sly Thr thr 
1S5 160 im 170 

Tyr Tvr gIh slu Val Asn Prix Met Leu Ser Ser Trp &\y Lm itir Lm 
17S ISO 10 

Arg Thr Gin ssr His Val Sin Ser Als pro 

19Q 



<210> 46 

<211> 165 

<212> PRT 

■<2:0> Artificial sequence 

<m> 

<ZIt> Shuffled propi^ptii^s Q^2<19 

<22{)> 

<221>- PROPSP 

<222> Cl).,a.S§} 

<4a0> 46 

Ala Thr sly Ala i;eu Pro Sin ser pro Thr Pro Glu Ala Asp Ala val 
1 5 10 IS 

' Mst Sin Glu Ala L&u s1i> Am Asp km Asp teu Thr ser Ala 61 u 
20 25 30 



Ala Sl« Sly L®« Leu Ala Ala sin Asp Thr Ala Phe slu val Asp slu 

Page 30 



imm. 204 -W . ST23 , txt 

u m 45 

Ala Ala Ala Ala Ala Ala sly Asp Ala lyr sly Gly ser val Phe Asp 
SO S5 '60 

Thr Glu ser Leu Thr Lets Thr Val Leu Val Thr As» Ala Ser Ala Val 
6S 70 75 80 

Glu Ala val sslu Ala Ala sly Ala Glu Ala Lys val va1 ser His sly 
05 SO 9S 

net <3l« 6ly teu g1« <s1u Jle val Ala Mp l«u Asr Ala Ala Asp Ala 

100 los ;uo 

sin Pro Glv' Val Val Sly trp Tyr Pfo Asp xle His ssr Asp Thr val 
lis 1'20 12.5 

Val Lm gIu Val Leu ^aOi 61 y ser ^ly Ala Asp val Asp sef teu Leu 
130 135 140 

Ala Asp Aia Ssiy Val Asp Ala Sef Ala Val elu val Thr thr Ser AM 
145 150 IS 5 160 

Qln Prd Slu Leu TVr 

lis 

<210> 47 

<;211> 3.66 

<212> BRT 

<213> Artificial sequssice 

<220> 

<M3> shufflsd propeptide S~2.73 

<22Q> 

<m> cij..<xs6) 

<4m> 47 

Ala itsr s1y Ala Lm ^to Gin ssr Pro thr Fro ^Slu Ala Asp Ala Val 
15 10 IS 

$er Mst Sin «5iu Ala Lsy Gin Arq Asp i.iy Asp Leu S«r ssr Ala 6lu 
10 " 25 30 

Ala 61 u Glu L.eu Leu Ala Ala a1n Asp ihr Ala Ph® Glu Val Asp Slu 
3S 40 45 

Ala Ala Ala oly Ail a Ala Gly Asp Ala lyr Gly Gly Ser Val Phe Asp 
50 ' 55 60" 

Thr Glu Thr Leu Glu Leu Thr Val Lett Val ihr Asp Ala Ser Ala vsl 
&5 70 75 Sq 

Faqe 31 



10495 , 204~», ST2S , txt 

Stu Ala Vai <3lu AlA Ala Ala gIu Ala tys val Val ssr His sly 
85 90 95 

Met 61v' i-su Glu sslu lie Val Ala Asp L«u Asn A"la Ala Asp Ala 
100 lOS 110 

Sin Pro Gly val Val Gly Trp lyr Pro Asp lie His ser A«p Thr Val 
lis 120 125 

Val Val sly val lets clu sly set- Gly Ala Asp val Asp sef teu Leu 
130 135 140 

Ala Asp Ala «ly val Asp Thr Ala Asp val tys val alu. ser Thr jhr 
145; 15Q lis 160 

<5lu <Sln Pro Glu L«u Tyr 
165 

<210> 48 
;<21i> 166 
<212> PBT 

•iMSy Artificial sequence 

<i2d> 

<223> Shuffled fjropeptidS e~l,43 



<220> 

^221?- PROPER 
<22.2> (3.) . . C166} 

<400> 4S 

Ala Tl^r Gly Ala ueu Pm Glfi Ssr Pro Tbr Pro Gla Ala as£> Ala val 

1 5 10 is: 



Ser Met Gin sslu Ala Leu Gin Ar-q Asp L«y &'\y leu ser ser Ser sSlf* 
20 " 25 30 

Ala Glu Clu Leu Leu Asp Ala Gin Ala 6lu Ser Ph-s 6lu xle Asp Slu 
3S 40 45 

Ala Ala Ala Ala Ala Ala Glv Asp Ala Tyr Sly 0ly ser lie Phe Asp 
SO 55 60 

Thr. Asp Ser Leu Thr Leu ihr Val l^eu va:1 Thr Asp Ala ser Ala Val 
65 70 75 80 

alu Ala val (>lu Ala Ala <3ly Ala filu Ala Lys val val Ser His <Sly 
85 90 m 

js^et Glu Gly Leu Glu «1y lie val Ala Asp Leu Asn Ala Ala Asp Ala 
IDO 10$ iio 



10495 , 2M-WCi, ST2S ,.txt 

Gin Pro Gly val Val sly Trp Tvr ^ro Asp lie Bis ser Asp Thr val 
lis 120 OS 

Val Leu slu ya'l l&u gIu ^'iy ser Sly Ala Asp Val Asp ser i.cu Leu 
130 135 - 



Ala Asp Ala Gly Val Asp Thr Ala Asp val tys val <s1u Ser Thf Tltr 
145 150 15$ im 

al« (Sin Pfo slu teii Tyr 
X65 



<2X0> 49 

<211> 166 

<i;!i>j,> PRT 

<213> Artificial seqMSitce 
<22Q> 

<22i> Shuffled propeptida <S^2.6 

<400> 49 

Ala siy Ala Leii Pro Gin sef Pro tf^r pro slu Ala Asp Ala vstl 
1 5 10 15 

Ser H&t sirs slu Ala teu Gin Arg Asp Leu Asp L«u Thr ser Ala Gtu 
20 25 m 

Ala Slu Glu L^u Leu Ala Ala Asp Thr Ala Phe (Slu val Asp (Slu 
SS 40 45 

Ala Ala Ala a14 Ala Ala <STy Asp Ala tyr Gly &\y ser lie Pha Asp 
SO 55 60 

Thr Slu Thr Leu C^lu Lm rhr val Leu Val Thr Asp ser ssr ssr val 
6S 70 75 m 

Slu Ala Val a1u Ala Ala Gly Ala Glu Ala Lys val val Ser His Gly 
85 M tS 

Met Glu Gly Leu Glu Slii lie val Ala Asp Leu ASR Ala Ala Asp Ala 
100 105 110 

Gin Pro Gly val val Gly Tfp Tj^f Pro il& Mis ser Asp rhr Val 
X.X^ [3«^0 aU^Si' 

val leu Glu Val Leu slu Cfy Ser Sly Ala Asp val Asp sfer L-gu Leu 
130 135 



Ala ^ily Ala Gly val As» Thr Ala Asp Val tys val Glif Ser Thr Thr 
145 150 1S5 IgO 

61 o Gin Pro slu Ley TV r 



1049S - 204"%«S. ST2S , txt 

L6S 



<210> 50_ 

<21i> 165 

<IIZ> PRT 

<2'13> Artif'seial sequence 

<22Q> 

<223> smff\M pmpeptide <&~2.3 
<120> 

r<223> PROPER 

<222> CD . < (165) 

<400> SO 

Ala Thr Gly Ala teu Pro Gin ser Pro rhr Pro Glw Ala Asp Ala Val 
1 ' S JO 15 

Ser i^iet Gin Glu Ala Ley Gin Arg Asp L&u Gly Lm Thr Pro Lau sly 
20 25 30 

Als Glu slu Le« Leu Ala Ala Gin asp thr Ala fhe alu val Asp olu 
35 40 45 

Ala Ala Ala 5lu Ala Ala sly Asp Ala Tyr Sly aly ser val Phe asp 
50 55 m 

THr Qlu Thf* Lfeu Gla Leti Thr val t^u val Thr Asp Ala sfej' Ala val 
65 70 ?S SO 

<a1u Ala val Slu Ala Ala sly Ala 4M Ala tys val Val ser His Sly 
as m 95 

Met Slu Sly Leu Sl« sin ils Val Ala Asp Lew Asrt; Ala Ala Asp Ala 

im 110 



feln Pro Glv val val <*1y Tfp Tyr Asp lis Nis ser Asp thr val 

12S 



Val Leu Glu Val Leu ialu Gly S&r Sly Ala Asp Va1 Asp Ser Lm km 

im 13S im 

Ala Asp Ala Gly val Asp Ala Ser Ala val Glu Val Thr prs Ala Ala 
145 150 155 1S0 

Arg Pro Gl« Leu T^r 
<2io> Si 

■<21I> 166 
<2;12> FRT 

^213> Artificial s«c{uence 



pa§e 34 



10495. ST2S , txt 



<223> 



Shyffled propeptide 




(1) > >€im) 



<400> 51 

Ala Thf Gly Ala Ley Pro (Sifj $«r Pro Thr Pro iSsSp <3ly Ala Slu 
1 S m IS 

Thr Thr Met Va1 slu Ala tsii Sin Arg Asp teu sly im Tfir prtj Ala 
2G 2S 30 

6Tu Ala Slu slu teu tey Ala Ala sin Asp Thr Ala Phe Slu Va1 Asp 



Glu Ala Ala A1a Ala Ala Ala aly Asp Ala Tyr <»ly Gly Ser lie f*he 
5S S5 SO 



Asp thr Asp Ser Leu Thr Leu thr val leu V^sT thr Asp Ala Ala Ala 
65 70 7S 80 



val {»1u Ala val 61 u Ala Ala sly Ala ^Slii Ala Lys Val Val Ser Hts 
S5 90 9S 



6ly Met G)u Sly Leu gTu:<s1« lie Val Ala Asp ley Asrt Ala Ala Asp 
108 iS5 liO 



aM Vat Pro Giy Val Val Sly Trp Tyr Pre* Asp Val Ala Gly Asp Thr 
lis 120 * 12S 



val Val Leu slu Vis 1 leu <ski Gly ser Gly Ala Asp Val Tyr Ser Leu 
130 135 * I40; 



Leu Ala Asp Ala Gly val Asp Ala Ser Ala Val slu val Thr pro Ala 
14S ISO 153 160 



Ala Gl»i pro slu Leu Tyr 
16S 



<2iO> 52 

<21i> 166 

<ziz> mr 

<213> Af*tif1c1a1 se^iuence 

<22Q> 

<223> shuffled propeptide <a-i,4 



40 



45 




PROPEP 
CD . . (166} 



<400> 52. 



Fags 



1O40S . 204~WO, ST2S . txt 
Ala thr Gly Ala Leu pjco <Sin Ssr f>m Thr Pro <Sly Ala Asp Ala val 
1 5 10 IS 

ser Met- sin clu Ala Leu sin A-rq Asp im sly tey ser ser ser alo 

20 "25 30 

Ala slii Ghi i,e« im Asp Ala ^In Ala. g1u ser Fhe slu lie Asp <slu 
35 49 4:5 

Ala Ala Ala Ala A1 a Ala Ala As|) ser Tyr sly <3ly ser lie Pba A&p 
50 SS 60 

Thr Asp Ser Leu Thr Leu Thf Val Leu Val Thr Asp Ala sar Ala val 
05 70 ?S go 

6lu Ala Val Glu Ala Ala Gly Ala slu Ala tys Val val s«r His Sly 
85 SO 95 

mt «lu Gly Leu Qlu Glu lis val Ala Asp Leu asji Ala Ala Asp Ala 
100 10s no 

Sl« Pro Gly val val Gly Trp Tyr pm Asp lie His ser A^p rftr Val 
tXS MQ 12s 

val Leu Glu val Leu Glu «ly S^r Gly Ala Asp Val Asjs Ser teu 
130 3.35 



Ala Asp Ala ialy Val Asp Thr Ala Asp val Lys val 6lu Ser Thr Thr 
145 ISO XS5 1$0 

61 « <aln Pro <3ly Leu Tyr 
165 

<210> S3 
<2M> PRT 

<213> Arti f 1 ci al sequence 
<220> 

<2M> skuffl&d propeptide 

<220>- 

<223> PROPEP 
<222> Cl) • ^(.166} 

-<400> S3 

Ala Thr Sly Ala Lea Pro Gl« ser Fro Thr Pro JSlu Ala Asp Ala val 
1 S 10 15 

ser Met <Siri Slu Ala L«u <s1h Am Asp im Asp Leu Thr Ser Ala Glu 
20 2S 30 



Ala <*tu sly Leu iLeu Ala Ala &n Asp Thr Ala Phe Glu Val Asp Sly 

Paga 36 



3S 



3495 , 204--». ST2S . txt 
45 



Ala Ala Ala. Ala Ala Ala «ly Asp Ala Tyr ely €ly ssr lie Fhe A&p 
5d 55 60 

Thr <Slu Thr Lm Qlu lsu Thr Val leu Val Thr Asp sef Sgf ssr val 
m 70 7$ gp 

Glu Ala val <>1y Ala Ala cly Ala slu Ala Lys val Val Ser His Sly 
8S M 9S 



Met Glo <Slv' teu Glu slu tie Val Ala 
100 105 



Leu Asri Ala Ala Asp Ala 



Gin Pro Gly Val Val Giv Trp Tyr Pro Asp lie His ser asp thr Va1 
lis 120 125 



Val Leu Glu Val Leu 6lu 61V ser <3ly Als Asp val Asp ser Lieu i;su 



140 



Ala Gly Ala sly val Asp Thr Ala Asp va:l Lys Val 6lu ser thr thr 
145 ISO ISS 160 



-gTu 6ln Pro 6lu Leu Tyr 
IB 
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